SECOND EDITION 



James D. Watson 

COLD SPRING HARBOR LABORATORY 

Michael Gilman 

COLD SPRING HARBOR LABORATORY 

Jan Witkowski 

BANBURY CENTER, COLD SPRING HARBOR LABORATORY 

Mark Zollcr 

GENENTECH. INC. 



SCIENTIFIC 
AMERICAN 

BOOKS 

Distributed bv 

W. H. Freeman and Company 
New York 



book The DnI d ouW h T ^ C SymbOHZeS S ° me ° f e,eme ™ ° f 
nration Th H 1 , u ""^ C ° * e b °° k ' aS k « to cover Hius- 

mmon. The blocks are double-stranded DNA fragments synthes.zed by the polymerase^ 

(Tee Cl api 6 - """^ ° f ^ " d ° UbleS <* -to the Lance 

ee Chapter 6). The coat colors of the rmce runrung down the helix (in the same direc 
non but wzth opposxte polanty!), are changmg from albino to chimeric, then chTeric to 
agout, These coat color changes show mice in which genetic engineering ha been used 
, to knock out a spectflc gene. The experiment 1S shown more reaLcailym Figure 

Library of Congress Cataloging-in-Publicariou Data. 

Recombinant DNA/James D.- Watson... [et al.]. — 2nd ed 
p. , cm. 

Includes bibliographical references and indent 

ISBN 0-7167-1994-0. — ISBN 0-7167-2282-8 (pbk.) 

1. Recombinant DNA. I: Watson, fames D 1978- 
QH442.R37 1992 ' " 

574.87'3282 — dc20 

91-38483 
CIP 

Copyright © 1983 by James D. Watson, John Tooze, and David T. Kurtz " 
MarSolLf b7jameS d - WaCS ° n ' MiChaCl ^-.J^ Witkowski, and 

P^*^££r 7 f ^ r Pr ° dU V ^ My mechanical ' P^ocographtc or electronic 
• system tr 3 ! u phonographs recording, nor may it be stored in a retrieval 

S^e pTbSb^ ° r ° therWlSe C ° Pled PUbI ' C ° f P ™ P—^n 

Printed in the United States of America 

Scientific American Books is a subsidiary of Scientific American, inc. 

Distributed by W. H. Freeman and Company, 41 Madison Avenue, New York 
New York 10010 7 

1 2 3 4 5 6 7 8 9 0 RRD 9 9 8 7 6 5 4 3? 



Contents 



Preface 

Acknowledgments 



Development of Recombinant DNA Technology 

1 ESTABLISHING THE ROLE OF GENES WITHIN CELLS 

The Building Blocks of All Life Are Cells 

Cells Are Tiny Expandable Factories That Simultaneously 

Synthesize Several Thousand Different Molecules 
A Cell's Molecules Can Be Divided into Small Molecules and 

Macroniolecuies , . 

Special Cellular Catalysts Called Enzymes Effectively Determine the 

Chemical Reactions That Occur in Cells 
A Given Protein Possesses a Unique Sequence of Amino Acids 

Along Its Polypeptide Chain 
The Functioning of an Enzyme Demands a Precise Folding of Its 

Polypeptide Chain 
Activation of Molecules to High-Energy Forms Promotes Their 

Chemical Reactivity 
Cellular Metabolism Can Be Visualized Through Metabolic Maps 
Enzymes Cannot Determine the Order of Ammo Acids in Polypeptide 

Chains 



IV 



■C O N T E N T S 



Mendel's Experiments with Pea Plants First 
Revealed the Discreteness of Genetic 
Determinants (Genes) 

Chromosomes Are the Cellular Bearers of 
Heredity 

The One Gene, One Protein Hypothesis Is 
Developed 



fvTA 



THE PRIMARY G EMETIC 



DNA Is Sited on Chromosomes 
Cells Contain RNA as well as DNA 
A Biological Assay for Genetic Molecules Is 
Discovered 

Viruses Are Packaged Genetic Elements That 

Move from Cell to Cell 
Molecules with Complementary Sizes and 

Shapes Attract One Another 
The Diameter of DNA Is Established 
The Nucleotides of DNA and RNA Are 
Linked Together by Regular 5'-3' 
Phosphodiester Bonds 
The Composition of Bases of DNA from 

DiiFerent Organisms Varies Greatly 
DNA Has a Highly "Regular Shape 
The Fundamental Unit of DNA Consists of 
Two Intertwined Pol ynucleoude Chains 
(the Double Helix) 
The Double Helix Is Held Together by " 
Hydrogen Bonds Between Base Pairs 
The Complementary Nature of DNA Is at the 

Heart of Its Capacity for Self-Replication 
DNA Replication Is Found to Be 

Semiconservative 
DNA Molecules Can Be Renatured as well as 
Denatured 

G*C Base Pairs Fall Apart Less Easily Than 

Their A*T Equivalents 
Palindromes Promote Intrastrand Hydrogen 

Bonding 

5-Methylcytosine Can Replace Cytosine in 
" DNA 

Chromosomes Contain Single DNA Molecules 
Viruses Are Sources of Homogeneous DNA 
Molecules 

Phage X DNA Can Insert into a Specific Site ' 

Along the E. coli Chromosome 
Abnormal Transducing Phages Provide. Unique 

Segments of Bacterial Chromosomes 
Plasraids Are Autonomously Replicating 

Mmichromosomes 
Circular DNA Molecules May Be Supercoiled 



6 
7 
10 



13 

13 
14 

IS 

, 16 

11 
18 

18 

19 

19 

20 

21 

22 

23 

23 

24 

24 

24 
25 

25 

26 

26 

27 



28 



Most Double Helices Are Right-Handed, 
but Under Special Conditions Certain 
DNA Nucleotide Sequences Lead to 
Left-Handed Helices 



A Mutation in the Hemogloblin Molecule 

Is Traced to a Single Amino Acid 

Replacement 
The Use of £ Coli Leads to the Rapid 

Development of Fine- Structure Genetics 
The Gene and Its Polypeptide Products Are 

Co linear , - 

RNA Carries Information from DNA to the 

Cytoplasmic Sites of Protein Synthesis 
How Do Amino Acids Line Up on RNA 

Templates? 
Roles of Enzymes and Templates in the 

Synthesis of Nucleic Acids and Proteins 
Proteins Are Synthesized from the Amino 

Terminus to the Carboxyl Terminus 
Three Forms of RNA Are Involved in Protein 

Synthesis 

Genetic Evidence Reveals That Codons 

Contain Three Bases 
RNA Chains Are Both Synthesized and 

Translated in a 5'-to-3' Direction 
Synthetic mRNA Is Used to Make the Codon 

Assignments 
The Genetic Code Is Fully Deciphered by 
June 1966 

"Wobble" Frequently Permits Single tRNA 
Species to Recognize Multiple Codons 

How Universal Is the Genetic Code? 

Average -Sized Genes Contain at Least 1200 
Base Pairs 

Mutations Change the Base Sequence of DNA 
Suppressor t R.N As Cause Misreading of the 

Genetic Code 
The Signals for Starting and Stopping the 

Synthesis of Specific RNA Molecules Are 

Encoded Within DNA Sequences 
Increasingly Accurate Systems Are Developed 

for the in Vitro Translation of 

Exogenously Added. mRNAs 



r THE GELISTiC ELEMENTS TKA 

Repressors Control Inducible Enzyme 
Synthesis 



Contents 



V 



Bacterial Genes with Related Functions Are 

Organized into Operons 
Promoters Are the Start Signals for R.NA 

Synthesis 

Repressor Molecules Are Normally Made at 

Constant Rates 
Repressors Are Isolated and Identified 
Positive Regulation of Gene Transcription Also 

Occurs 

Attenuation Is Another Form of Regulation 
Translations! Control Is the Second Means of 

Controlling Protein Synthesis 
Probing Gene Regulation in Higher Plants and 

Animals Presented Early Difficulties 
Purified Xenopus Ribosomal RNA Genes Are 

Isolated 

Eukaryodc mRNAs Have Caps and Tails 
Eukaryotes Are Found to Have Three Kinds of 

RNA Polymerases 
Eukaryotie DNA Is Organized into 

Nucleosomes 
Animal Viruses Are Model Systems for Gene 

Expression in Higher Cells 
The RNA Tumor Viruses Replicate by Means 

of a Double-Stranded DNA Intermediate 



METHODS OF CREATING 
RECOMBINANT -DNA MOLECULES 



Nucleic Acid Sequencing Methods Are 
Developed 

Restriction Enzymes Make Sequence-Specific 

Cuts in DNA 
Restriction Maps Are Highly Specific 
Restriction Fragments Lead to Powerful New 

Methods for Sequencing DNA 
Oligonucleotides Can Be Synthesized 

Chemically 

Many Restriction Enzymes Produce Fragments 
Containing Sticky (Cohesive) Ends 

Many Enzymes Are Involved in DNA 
Replication 

Sticky Ends May Be Enzymatically Added to 

Blunt-Ended DNA Molecules 
Small Plasmids Are Vectors for the Cloning of 

Foreign Genes 
DNA of Higher Organisms Becomes Open to 

Molecular Analysis 
Scientists Voice Concerns About the Dangers 

of Unrestricted Gene Cloning 
Guidelines for Recombinant DNA Research 

Are Proposed at the Asilomar Conference 
Recombinant DNA Comes of Age 



51 

51 

52 
52 

53 
53 

54 

55 

56 
56 

57 

57 

58 

58 



63 

63 

64 
66 

67 

69 

70 

71 

72 

73 

74 

75 

75 
75 



Analysis of Cloned Genes 



The Polymerase Chain Reaction Amplifies 

Specific Regions of DNA 
Performing a Polymerase Cham Reaction 
Taq Polymerase Simplifies and Improves the 

PCR 

Fidelity of DNA Synthesis by Taq Polymerase 
Determines the Accuracy of PCR 
Amplification 

DNA for the PCR Comes from a Variety of 
Sources 

PCR is Used to Amplify Human- Specific DNA 
Sequences 

PCR Products Can Be Sequenced Directly 
Detecting Mutations Using PCR Amplification 
PCR Amplification Is Used for Monitoring 

Cancer Therapy 
PCR Amplification Is Used to Detect Bacterial 

and Viral Infections 
PCR Amplification Is Used For Sex 

Determination of Prenatal Cells 
PCR Methods Permit Linkage Analysis Using 

Single Sperm Cells 
PCR Techniques Are Used in Studies of 

Molecular Evolution 
Contamination Can Be a Problem in PCR 

Studies 

The Polymerase Chain Reaction — A Technical 
Revolution in Molecular Genetics 



/ 1 HH 



LATION OF CLONED GENES 



Improved Bacteria and Vectors Are Developed 
Basic Strategies for Cloning Involve Three 
Steps 

Choosing the Right Starting Material Is 

Essential in Cloning 
mRNA Is Converted to cDNA by Enzymatic 

Reactions 

cDNA Molecules Are Joined to Vector DNA 

for Propagation in Bacteria 
Nucleic Acid Probes Are Used to Locate 

Clones Carrying a Desired DNA Sequence 
Synthetic Oligonucleotide Probes Can Be 

Designed from Known Protein Sequence 
Tissue-Specific cDNAs Are Identified by 

Differential Hybridization 
Gene Probes from Conserved Segments in 

Protein Families Identify New Related 

Genes 



79 

79 
80 

82 

84 

85 

86 
88 
88 

89 

89 

91 

92 

94 

95 

95 

99 
100- 

100 

102 

102 

104 

104 

107 

107 

Ul 



4 



VI 



CONTENTS 



Expression Vectors May Be Used to Isolate 

Specific cDNAs 
Cloned Genes Can Be Isolated by Functional 

iVssay in £ colt 
Cloned Genes Caa Be Isolated by Functional 

Assay in Eukaryotic Cells 
Cloned DNA Is Analyzed by DNA Sequencing 
Computers Have Simplified Translating DNA 

Sequence into Protein Sequence 
Searching Sequence Databases to -Identify 

Proteins and Protein Functions 
Several Procedures Exist to Analyze Proteins 

■ Encoded by cDNA Clones ■ 
Genomic Fragments Are Cloned in 

Bacteriophage 
Cosmids Allow the Cloning of Large Segments 

of Genomic DNA 
Chromosome Walking Is Used to Analyze 

Long Stretches of Eukaryotic DNA 
Southern and Northern Blotting Procedures 

Analyze DNA and RNA 



THE COMPLEXITY OF THE GENOME 

Split Genes Are Discovered 

Intr/ons Are Discovered in Eukaryotic Genes 

Specific Base Sequences Are Found at Exon- 

Intrbn Boundaries 
Alternative Splicing Pathways Generate 

Different mRNAs from a Single Gene 
Introns Sometimes Mark Functional Protein 

Domains 

Transcriptional Control Regions Occur 

Throughout Eukaryotic Genes 
RNA Polymerase III Transcription Is 

Regulated by Sequences In the Middle of 

the Gene 

Genes Encoding Abundant Products Are Often 

Tandemly Repeated 
Clustered Globin Genes Exhibit Coordinated 

Expression in Development 
Psendo.genes Arise by Duplication of an Active 

Gene and Accumulate Mutations During 

Evolution 
Short Repetitive DNA Sequences Are 

Dispersed Throughout the Eukaryotic 

Genome 

Transcriptionally Inactive Regions of the 
Genome Are Often Found to Be 
Methylated 

Most Genes Are Much Larger Than Their 
mRNA 



113 
11.3 

IIS 
116 

1.19 

120 

124 
127 
127 
127 



Genes Are Sometimes Encoded Within Genes 
Summary 



148 
149 



135 

136 
137 

138 

140 

140 

142 

143 
1 4d 
144 

145 

146 

147 
147 



r tt/" 



Common Sequence Motifs Are Recognized m 

5'-Flankiug Regions 
Enhancers Activate. Gene Expression over Long 

Distances 
Enhancers Can Be Tissue-Specific or 

Regulated by Signals 
Enhancers Contain Recognition Sites for DNA- 

Binding Proteins 
Genes Can Be Transcribed in Cell-Free 

Extracts 

Gene -Specific Transcription Factors Are DNA 

Binding Proteins 
Transcription Factors Are Purified and Cloned 
Transcription Factors Fall into Structural 

Families 

Transcription Factors Are Modular 
How Do Transcription Factors Work? 
The Cellular Splicing Machinery Is 

Extraordinarily Selective and Precise 
Synthetic Pre-mRNAs Are Spliced In Oocytes 

and Cell-Free Extracts 
Cell-Free Extracts Are Fractionated to Identify 

Splicing Factors 
Trans- Ac ting Splicing Factors Govern 

Alternative Splicing 



lO MOVABLE GENES 

Sequencing Reveals the Organization of 

Bacterial Transposable Elements 
So me Transposable Elements in Eukaryotes 

Resemble Bacterial Transposons 
Ty Elements in Yeast Are a Different Class .of 

Transposable Elements 
Repetitive Elements and Processed 

Pseudogenes Are Remnants of 

Transposition Events 
Bacterial Transposons Jump via DNA 

Intermediates 
Ty Elements Transpose via an RNA 

Intermediate 
Mating Type Intercon version in Yeast Occurs 

by Replicative Transposition 
Functional Immunoglobulin Genes Are Formed 

by Ordered Gene Rearrangements 
Gene Rearrangements Sometimes Go Awry 



153 

153 

155 

157 

158 

159 

160 
161 

163 
165 
167 

168 

169 

170 

171 

175 

176 
177 
177 

179 

179 

181 

181 

183 
185 



Concents 



Trausposable Elements Are Potent Tods for 
" Identifying and Manipulating Genes 



New Tools for Studying 
Gene Function 

/ / |H VITRO MUTAGENESIS 



186 



In Vitio Mutagenesis Is Used to Study Gene 
Function 

Restriction Eado nuclease Sites Provide the 
Simplest Access for Mutagenesis 

Linker Insertion Is Used to Map a Bacterial 
Transposon 

Construction of Nested Deletions Maps the 
. Boundaries of a Transcriptional Control 
Region 

Linker-Scanning; Mutagenesis Permits 

Systematic Analysis of Promoters 
Random Nucleotide Substitutions Are 

Obtained by Chemical Modification of 

DNA or by Enzymatic Misincorporation 
Synthetic Oligonucleotides Facilitate 

Mutagenesis 
Mutant Clones Can Be Identified by 

Hybridization and DNA Sequencing 
Oligonucleotide Cassettes Provide a Simple 

Method for Introducing Directed 

Mutations 

Gene Synthesis Facilitates Production of 

Normal and Mutant Proteins 
The PGR Can Be Used to Construct Genes 

Encoding Chimeric Proteins - 
Mutagenesis Is the Gateway to Gene Function 

and Protein Engineering 

1 2 TRANSFERRING GENES INTO 



MAMMALIAN 



Establishment of Immortal Cell Lines Makes 

Gene Transfer Practical . 
Gene Transfer- Was First Used in the Study of 

Tumor Viruses 
Selectable Markets That Work in Mammalian 

Cells Allow Gene Transfer by 

Co transformation 
Exogeneous DNA Is Transiently Expressed in 

Many Cells Immediately Following 

Transfection 
Gene Amplification Is Used to Achieve High- 
Level Protein Expression ■ 



191 



192 
193 
19 J 

195 
195 

197 

201 
201 

202 
204 
207 
209 



213 

214 
214 

.216 



218 



219 



Specialized Methods ate Developed' to 

Transfecr Difficult Cell "types 
Viral Vectors Introduce Foreign DNA into 

Cells with High Efficiency 
Vaccinia and Bacolo virus Are Used for High- 
Level Protein Production 
Retroviruses Provide High-Efficiency Vectors 

for Stable Gene Transfer 
The Experimenter Has Little Control over the 

Fate of Transferred DNA 
Antisense RNA and DNA Can Extinguish 

Gene Function 
Homologous Recombination Is Used to 

Inactivate Cellular Genes 

13 USING YEAST TO STUDY 
EUKARYOTIC GENE FUNCTION 

Yeast Biosynthetic Genes Are Cloned by 
Complementation of E. colt Mutations 

Shuttle Vectors Replicate In Both E, Colt and 
Yeast 

Yeast Genes Can Be Cloned by Simple 

Complementation Strategies 
Homologous Recombination Is a Relatively 

Frequent Event In Yeast 
Cloning Genes Required for Mating Reveals a 

Signaling Pathway Similar to That Seen, in 

Higher Organisms 
Genetic Experiments in Yeast Can Answer 

Precise Biochemical Questions 
Genetic Analysis in Yeast Can Be Exploited to 

Identify and Study Genes from Higher 

Organisms 

14 THE INTRODUCTION OF FOREIGN - 
GENES INTO MICE 

Foreign Genes Become Integrated in the 
Chromosomes of Recipient Animals 

Foreign DNA Can Become Stably Integrated 
Into Germ Line Cells 

Embryonic Stem Cells Can Carry Foreign 
Genes. 

Transgenes Can Be Regulated in a Tissue- 
Specific Manner 

Transgene Expression Can Be Targeted to 
Specific Tissues 

Transgenes Can Be Used to Kill Specific Cell 
Types 

Retroviruses Can Be Used to Trace Cell 
Lineages 

Transgenes Can Disrupt the Functioning of 
Endogenous Genes 



vii 

221 
222 
224 

227 , 

228 

228 

235 

236 
237 
239 
240 

241 
245 

250 

255 

256 

257 

257 

259 

259 

260' 

261 

261 



VUt I CONTENTS 



Knocking Out Genes by Homologous 

Recombination Can Elucidate Complex 

Gene Systems 
Studying Genetic Control of Mouse 

Development 
Transgenes Can Be Used to Detect Host 

Genes 

A Single Gene Turns Female Mice into Males 
Transgenic Mice Provide Models of Human 
Diseases 

Imprinting — Males and Females Make 

Unequal Genetic Contributions to Their 
Offspring 



; \J"h" ) -X 



,NK 



Plants Have Advantages and Disadvantages for 

Genetic Engineering 
Whole Plants Can Be Grown from Single Cells 
Leaf Disks Are an Important Target for Gene 

Transfer 

Tl Plasmid of Agrobacierium Causes Crown Gall 
Tumors 

T-DNA, Part of the Ti Plasmid, Is Transferred 
to Plant Cells 

T-DNA Has Been Modified to Act as a Gene 
Vector 

Reporter Genes Demonstrate Trans-gene 

Expression in Plant Tissues 
Viruses Can Be Used as Vectors for Whole 

Plants 

Guns and Electric Shocks. Transfer DNA into 
Plant Cells 

Bombardment with DNA-Coated Beads Can 

Produce Transgenic Organelles 
Plant Genes Can Be Cloned by Using 

Transposable Elements 
T-DNA Is Used as an Insertion Mutagen 
Arabidopsis'h Being Used as a Model Organism 

for Molecular Genetic Analysis of Plants 



261 

263 

' 263 
264 

266 

267 
273 

274 
274 

276 

277 

277 

278 

281 

282 

282 

285 

285 
287 

290 



Analysis of Important Biological 
Processes by Using Recombinant DNA 



MOUECULES OE IMMUNE 



The Basic Structure of Antibody Molecules Is 
Established ■ ■ 

Fledgling Recombinant DNA Technology 
Verifies the Dreyer and Bennett 
Hypothesis 



293 



294 



■ 295 



Rearrangement of Antibody Genes Generates 
Additional Diversity at the V-C function 
Class Switching Places Useful Recognition 
Specificities on Antibody Molecules with 
Different Functional Properties 
The Mechanisms of Antibody Gene 
Rearrangement Can Be Studied by 
Introducing Artificial Genes into Cells 
The Study of Cellular Immunity Was Greatlv 

Advanced by Gene Cloning s 
T Cells Recognize Antigens Only on Cells 

from the Same Individual 
Cloning of the MHC Genes Reveals That Self 
Identity is Determined by a Few 
Polymorphic Genes 
T-Cell Receptors Recognize Antigens Only In 

Association with an MHC Molecule 
Intercellular Communication Regulates 

Immune System Function 
The Immunoglobulin Super family Encodes 
Proteins That Participate in Cell-Cell 
Communication 
Malfunctions of the Immune System Underlie 
Many Diseases 

1 7 MOVING SIGNALS ACROSS 
MEMBRANES 

The Acetylcholine Receptor Is a Ligand-Gaced 

Ion Channel 
So me Receptors Are Coupled to Second 

Messenger Systems via GTP-Binding 

Proteins 

G Protein-Linked Receptors Span the 

Membrane Seven Times 
Domain Swapping Reveals the Structural Basis 

of Receptor-Effector Coupling 
Visual Pigments Are Signaling Receptors 
Growth Factor Receptors Have Intrinsic 

Enzymatic Activity 
Receptors Can Be Associated with Intracellular 

Protein Kinases 
Receptors Activate Common Second 

Messenger Systems 
Protein Phosphorylation Is a Principal 

Mechanism for Signaling 
Steroid Receptors Are Transcription Factors 
cAMP Signals Reach the Nucleus via 

Transcription Factor Phosphorylation 
Studying Target Genes Reveals the 

Organization of Signal Transduction 
Pathways 

Immediate Early Genes Are Third Messengers 



295 

299 

300 
302 
304 

305 
305 
307 

309 
310 



313 



315 



315 



317 

317 
318 



319 

322 

323 

325 
327 



28 



329 
331 



Contents 



IX 



Cancer Is a Genetic Disease 

Turn or (Jells Have Aberrant Growth Properties 

in Cell Culture 
Tumor Viruses Opened the Study of Cancer to 

Molecular Methods 
Retroviral Oncogenes Are Captured from 

Cellular DNA 
Viruses (Jan Activate Cellular Proto- 

On.cogen.es by Inserrional Mutagenesis 
Analysis of Tumor Chromosomes Reveals 

Rearrangements of Proto-Oncogenes 
The Trans formed Phenotype Can Be Passed 

via DIM A Mediated Transfection 
An Activated Human Oncogene Is Cloned 
The Human Bladder Carcinoma Oncogene Is 

an Activated ras Gene 
The Bladder Carcinoma ras Oncogene Is 

Activated by Point Mutation 
Proto-Oncogenes Encode Components of 

Signal Transduction Pathways 
Proto-Oncogenes Can Encode Growth Factors 

or Their Receptors 
Many Proto Oncogenes Act as Intracellular 

Signal Transducers 
Some Proto-Oncogenes Are Transcription 

Factors 

Experimentally Modeling the Complexities of 

Cancer: Multiple Hits 
Oncogenes Cause Cancer in Transgenic Mice 
The Fusion Paradox; Normal Growth Is 

Dominant to Transformation 
Susceptibility to Cancer Can. Be Inherited 
The Cloning of the Retinoblastoma 

Susceptibility Gene 
The RB Protein Is the Target of DNA Tumor 

Virus Oncogenes 
The Strange Case of p53: An Oncogene 

Crosses to the . Other Side 
The Several Steps to Colorectal Cancer: 

A Real Life Tale of Oncogenes and 

Anti-Oncogenes 
Cancer Results from Accumulation of 

Dominant and Recessive Mutations 

19 MOLECULAR ANALYSIS OF . 



There Are Four Stages in the Cell Cycle 
There Are Two Types of Control of Cell 
Division 

Yeasts Are Good Systems for Studying Cell 
Cycle Control 



J 3 ..) 



336 
336 
336 
339 
340 
340 

341 

341 

345 

34-7 

349 

349 

349 

351 

352 
353 

355 
355 

357 

358 

358 

361 
363 

369 

370 

370 
373 



Temnersture-SenSLti ve Mutants Are a Valuable 
i. 

Tool in Studying the Cell Division Cycle 
The CDC2S Gene Encodes a Ubiquitous 

Protein Kinase 
The Protein Kinase Activity of tire Cdc.2 

Protein Varies with the Cell Cycle 
MPF Contains Cdc2 
Cychns Are Cloned 

Cychri and Cdc2 Form a Protein Complex 
Cyclin Destruction is Required to Inactivate 

the Kinase Activity 
A Different Set of Cyclins Regulates Cdc28 in 

Gl Phase 

Other Cdc Proteins Are Also Involved in 
Regulating Cdc2 Kinase Activity 

Recombinant DNA Provides a Common 
Language for Different Experimental 
Systems 



THAT CONTROL 



DE V ELoPMlN T 



HILA. 



Serendipity and Systematic Genetic Screens 

Identify Genes That Control Development 
The First Drosophila Development Genes Were 

Cloned by Chromosome Walking 
The Homeodomain Is Shared among Drosophila 

Developmental Genes 
Anterior-Posterior Polarity Arises from a 

Gradient of a Homeodomain Protein 
Gap Genes Define die First Subdivisions of the 

Embryo 

Segmentation Genes Divide the Embryo into 
Stripes 

The bicoid Protein Directly Regulates the 

Transcription of hunchback 
Expression in Stripes Is Encoded by Discrete 

Cis-Acting Regulatory Elements 
The Specificity of Homeotic Genes Remains 

Unexplained 
Formation of the Embryo's Ends Requires a 

Protein Kinase— Dependent Signal 

Transduction Pathway 
Dorsal- Ventral Polarity Is Achieved by 

Regulated Transport of a Transcription 

Factor to the Nucleus 
Drosophila Developmental Genes Help Isolate 

New Vertebrate Developmental Genes 



PA TIM 



Genes Specifying the Development of Neurons 
Are Cloned 



375 

3 /6 
377 
380 
380 

382 

383 

386 

386 

389 

390 
392 
393 
394 
395 
395 
396 
398 
400 

401 

402 
404 

409 
410 



COMTE N T S 



Neurotrophic Factors Stimulate Neuron 

Growth and Differentiation 
Retroviruses Are Used to Mark and 

Immortalize Neurons 
Cloning and Mutagenesis Establish Structural 

Models of the Voltage-Gated Ion Channels 
Neurotransmitter Receptors Are Members of 

Large Gene Families 
Homology- Cloning Is Used to Isolate 

Components of Signal Transduction 

Pathways in Olfactory Neurons 
Learning and Memory Require Stable Changes 

in Neuron FYmction 
Molecular Cloning and Gene Mapping Begin 

to Explore Alzheimer's Disease 



22 



RECOMBINANT DNA AND EVOLUTION 



Life May Flave Originated in an RNA World 

DNA— Why So Much? 

Genes Can Be Turned On (and Off) by 

Movable Elements 
In.trons Are Ancient Components of Genes 
Exon Shuffling Contributes to Gene Evolution 
Gene Duplication Is a Driving Force in " 

Evolution 

DNA Clocks Measure Rates of Evolution 
rRNA Sequencing Identifies the Three Great 

Kingdoms of Living Organisms 
Recombinant DNA Techniques Have Sorted 

Out Relationships in the Primate Family 
Mitochondrial DNA as a Molecular Clock 
Some Intracellular Organelles Were Once 
1 Bacteria 

DNA Fingerprinting Helps Us Understand the 
Genetic Basis of Altruism 



Application of Recombinant DNA 
to Biotechnology 

23 RECOMBINANT DNA IN MEDICINE 
AND INDUSTRY 



Expression Systems Are Developed to Produa 

Recombinant Proteins 
Insulin Is the First Recombinant Drug 

Licensed for Human Use 
".Recombinant Human Growth Hormone Is 

Produced in Bacteria by Two Methods 
A Hepatitis B Virus Vaccine Is Produced in 

Yeast by Expression of a Viral Surface 

Antigen 



412 
4 ■ 3 
416 
418 

422 
423 
428 

433 

435 
■438 

439 
440 
44 1 



443 
444 



445 

445 
446 

447 

448 



453 

454 
455 
455 

458 



Complex Human Proteins Are 'Produced by 
Large-Scale Mammalian Cell Culm re ' 

Monoclonal Antibodies Function as ! 'Ma^c 
Bullets" 

Human Antibodies That Recognize Specific 
Antigens Can Be Directly Cloned and 

Selected 

"Humanized" Monoclonal Antibodies Retain 

Activity But Lose Imrrmnogenicity 
Protein Engineering Can Tailor Antibodies for 

Specific Applications 
Protein Engineering Is Used to Improve a - 

D e re r ge n t Enzy me 
Growth Hormone Variants with Improved 

Binding Are Selected by Phage Display 
New Technologies Promise New Approaches 

to Drug Design 

24 GENERATION OF AGRICULTURALLY 
IMPORTANT PLANTS AND ANIMALS 

Plants Expressing a Viral Coat Protein Resist 
Infection 

Insects Fail to Prey on Plants Expressing, a 

Bacterial Toxin 
Herbicide-Tolerant Transgenic Plants Allow 

More Effective Management of Weeds 
Flowers Exhibiting New -Colors and Patterns 

Can Be Obtained by Genetic Engineering 
The Potential Use of Plants to Produce 

Proteins Is of Commercial Importance 
Recombinant Bovine Growth Hormone 

Stimulates Milk Production and Improves 

Feed Utilization 
Procedures for Generation of Transgenic Farm 

Animals Are Developed 
Pharmaceutical Proteins Can Be Produced m 

Transgenic .A..jQ-i m a Is 
Farm Animals May Be Protected from Viral 

Infection by Transgenic Expression of 

Viral Coat Protein 
The Implementation of Agricultural 

Biotechnology Requires Continued 

Research and Social Discussion 



Zj MARSHALING RECOMBINANT DNA 



a. 



IDS 



Human Immunodeficiency Virus Is the Cause 

of AIDS 
HIV Is a Retrovirus 

HIV Belongs to the Most Complex Class of 
Retroviruses Yet Discovered 



458 
460 

461 
464 
465 
465 
466 
468 



471 

472 
473 
473 
475 
477 

478 
478 
479 

481 

481 



485 

486 
486 

488 



Contents 



XI 



The tat Gene Regulates Synthesis of HIV RNA 
Movement of HIV RNA from Nucleus to 

Cytoplasm Is Regulated by Rev 
The nef Gene May Be Essential for in Vivo 

Replication of Pathogenic AIDS 
AZT Works by Interfering with Viral UNA 

Synthesis 

HIV Protease Is a Target for AIDS Drug 
Therapy 

HIV Infects and Kills T Lymphocytes That 

Have the CD4 Receptor 
Soluble CD4 Molecules Can Be Used to 

Prevent HIV Infection 
CD4-Toxm Conjugates Specifically Kill 

HIV-infected Cells 
Transport of HIV Proteins to the Cell Surface 

Can Be Inhibited 
Simian Immunodeficiency Vims and Animal 

Models Are Useful in Studying AIDS 
Recombinant HIV Proteins May Be Effective 

as Immunogens for AIDS Vaccines 
Kaposi's Sarcoma Is a Tumor Associated with 

AIDS 

The Origins and Evolution of the Human 
[mimmodeficiency Viruses Are Revealed 
Through Recombinant Techniques 

Recombinant DMA Is at the Forefront of the 
Battle Against AIDS 



Impact of Recombinant 
Human Genetics 



on 



MAf 



- fs.l; 



AM 



1 r OKjTts 



HUMAN 



Human Genetic Diseases Have Simple and 

Complex Patterns of Inheritance 
The Metabolic Basis Is Known for Some 

Human Inherited Diseases 
"Positional Cloning" Uses the Location of a 

Gene on a Chromosome to Clone the 

Gene 

Subchromosomal Mapping of Genes and 

Markers Can Be Accomplished with 

Somatic Cell Hybrids 
Cloned Genes and Markers Can Be Localized 

by In Situ Hybridization to Chromosomes 
Chromosomal Abnormalities Provide Another 

Means of Locating Disease Genes 
Restriction Fragment Length Polymorphisms 

Serve as Markers for Linkage Analysis 



489 
491 
493 

496 
496 
498 
498 
500 
500 
501 
504 

504 
505 



511 

512 
513 

514 

514 

517 
517 
519 



Linkage is Calculated from Frequency of 

Recombination 
Abnormal X Chromosomes Provide a Means of 

Cloning the Gene for Duchenne Muscular 

Dystrophy 

cDNAs for the DMD Gene Were Cloned 

Using Two Strategies 
The Cystic Fibrosis Gene Was Cloned Using 

RFLP Analysis and Chromosome Jumping 
The Cystic Fibrosis Gene Was Identified by 

DNA Sequencing 
Clues about Protein Function Come from the 

Sequences "of Cloned Disease Genes ' 
Cloned Genes Ate Used to Study Protein 

Expression and Function 
Cloning Genes for Polygenic and Multifactorial 

Disorders Is Difficult 
Candidate Genes Can Be Used to Clone 

Human Disease Genes 
Cloning Human Disease Genes Will Continue 

to Depend on Linkage Studies 

27 .DMA-BASED DIAGNOSIS OF 

GENETIC DISEASES 

Biochemical Markers Used for Early Diagnosis 
Mutations in Globin Genes Cause the 

Thalassemias 
RFLPs and Linkage Analysis Are Used for 

Diagnosis 

Linkage Disequilibrium Can Be Used for 
Diagnosis 

Exon Deletions Are Used for Direct Diagnosis 
of DMD 

Gene Mutations Can Alter a Restriction 

Enzyme Site: Direct Diagnosis of 

Sickle-Cell Anemia 
Allele-Specihc Oligonucleotide Probes Are 

Used to Detect Mutations 
The Ligase-Media.ted Technique Detects 

Mutations 
The Polymerase Chain Reaction 

Revolutionizes DNA Based Diagnosis 
DNA Diagnostic Methods Are Used to 

Distinguish Tumor Types 
Novel Methods Are Developed for Screening 

for Mutations 
Genetic Testing May Bring Problems As Well 

As Benefits 
Hypervariable or Variable Tandem Repeat 

Loci Can Be Used to Identify Individuals 
DNA Fingerprinting Is Used In the Courts 
Genetic Privacy Will Become an Important 

Issue 



522 

522 
524 
525 
528 
530 
532 
532 
533 
533 

539 
540 

540 

545 

547 

548 

549 
550 
552 
552 
554 

559 

561 

562 

563 



CONTENTS 



.N.N N 



Ger|e Defects Have Been Corrected in 

Transgenic Animals 
Gene Therapy in Human Beings Raises Ethical 

Issues 

The Cystic Fibrosis Defect Can. Be Corrected 
in Vitro 

Hematopoietic Cells Are Used for Expression 

of Human Genes in Animals 
Genetically Engineered Bone Marrow Cells 

Survive for Long Periods in Vivo 
Skin Fibroblasts Are Target Cells for Gene 

Therapy 

Hepatocytes May be Used for Gene Therapy 
Gene Therapy Experiments Have Been 

Conducted in Earge Animals 
Myoblast Transfer Is Used to Treat Duchenne 

Muscular Dystrophy 
Genes Can Be Delivered Directly to Target 

Sites in Vivo 
Genetically Modified Lymphocytes Have Been 

Administered to Human Beings 
Human Adenosine Deaminase Deficiency is 

Treated by Gene Therapy 



29 STUDYING WHOLE GENOMES 

Very Large Pieces of DNA Can Be Separated 
by Pulsed Field Gel Electrophoresis 
(PFGE) 

PFGE Is Used to Make Large-Scale Physical 
Maps 

Putting Together the Cloned Genome of K coli 
Requires Finding Overlapping Segments 

Yeast Artificial Chromosomes (YACs) Are Used 
for Cloning Huge DNA Fragments 

YACs Are Used to Link the Cosmid Contigs'of 
C. degans 

Co smid and YAC Clones Are Ordered along 
the Chromosomes of DrosopMa Salivary 
Gland Cells by in Situ Hybridization 



5 67 

568 

569 

569 

572 

573 

573 
575 

•575 

576 

576 

577 

578 



583 



584 
585 
587 
590 
592. 

592 



An Entire Yeast Chromosome Has Been 

Sequenced 
A Multiplex Method Speeds Up DNA 

Sequencing; 

Automated DNA Sequencing Greatly Speeds 

Up the Process 
Understanding of DNA Sequences Is Furthered 

by Homology Comparisons 
Novel Methods Will Be Required for 

Large-Scale Sequencing of DNA. 



30 



TUMAN GENOME INITTATji/F 



iNNING ALL THE HUMAN GENES 



Making a High -Resolution Genetic Map of ' 

Humans Uses Reference Markers- 
Human Chromosomes A.re Separated from 

Each Other Using Cell Sorting Machines 
DNA for Cloning Can Be Micro-dissected from 

Human Chromosomes 
Somatic Cell Hybrids Serve as S ources for 

Purified Human Chromosome DNA 
X- Irradiated Fragments of Human 

Chromosomes Are Used for Gene 

Mapping 

Cloned Human DNA Fragments Must Be 

Assembled into Megabase Sized Contigs 
Sequence-Tagged Sites Identify Cloned DNA 
Comolere Human Dic^oe^ Cl^^ r~ 

Reassembled in Y"ACs 
YACs Are Used to Clone the Telomeres of 

Fluman Chromosomes 
YACs Helped Unravel the Mysteries of the 

Fragile X Region 
Large-Scale Sequencing of the Human HPRT 

Gene Was Done in Four Staees 
Storing and Analyzing Genome Data Require 

Large Databases 
Gene Mapping Can Be Facilitated by 

Comparing Species 
Understanding Our Genome Will Benefit 

Humanity 

Index 



595 
595 
595 
598 
600 

603 

605 
606 
606 
608 

608 

609 
610 

612 
612 
613 
614 
614 
615 
616 
619 



Application of recombinant DNA techniques to biology 
is bringing about a revolution in our understanding of 
living organisms. There is no field of experimental bi- 
ology that is untouched by the power we now have to 
isolate, analyze, and manipulate genes. When the Erst 
edition of Recombinant DNA was published in 1983, re- 
combinant DNA techniques were already being used 
extensivelv for the analysis of viral and bacterial genetics, 
but dissection of eukaryotic genes was only just begin- 
ning- There were hints of what was to come. The concept 
of the gene as a continuous stretch of DNA had been 
shattered with the discovery of introns, but alternative 
splicing and genes- within- genes were yet to be revealed. 
Identification of cellular oncogenes seemed to promise 
an understanding of cancer, but the mechanisms of their 
action and the existence of rumor suppressor genes™ 
were still subjects for speculation,. A handful of genetic 
diseases were being analyzed at the molecular level, but 
the isolation of the disease genes and the development 
of gene therapy were yet to come. 

Our aim in writing the second edition ol Recombinant 
DNA is to show how recombinant DNA techniques have 
led to the explosion in our knowledge of fundamental 
biological processes. As in the first edition, which w v as 
subtitled A Short Course, we provide a concise presentation 
of the methods, underlying concepts, and far-reaching 
applications of recombinant DNA technology. The field 
has grown since the publication of the first edition, and 
so has our book. But even though our previous subtitle 
may be. inappropriate for this enlarged edition, our ap- 
proach to the material has remained true to the spirit of 
the "short course": as before, the uninitiated will rind 
access to the field of recombinant DNA here. 

The book is now divided into six major sections. The 
first five chapters, which are largely unchanged from 
the first edition, provide a historical introduction to the 



early development of recombinant DNA technology, up 
to the point when studies of eukaryotic organisms began 
in earnest. In the next section we describe in detail the 
methods currently used to clone and analyze genes, and 
devote an entire chapter to the polymerase chain re- 
action, which has had an extraordinary impact on re- 
search. The great power of recombinant DNA techniques, 
comes from the ability to explore gene functions by 
manipulating genes and then introducing them back into 
cells. The third section of the book discusses how this 
is done in mammalian cells, yeast, mice, and plants. The 
fourth section describes the progress these manipulations 
have allowed in key areas of biology. Here the range of 
recombinant DNA applications is demonstrated, from 
the analysis of cell cycle control and embryonic devel- 
opment, to the isolation of genes involved with brain 
function. Indeed, these techniques have spawned a whole 
industry — biotechnology. In the fifth section, we describe 
some of its accomplishments, including the development 
of genetically engineered pharmaceutical and agricul- 
tural products, and the studies of the human immuno- 
deficiency virus that are leading the attack on AIDS. The 
differences between the first and second editions are 
perhaps most evident in the final section, where we de- 
scribe the revolution in human molecular genetics' and 
the ways in which recombinant DNA techniques are 
providing new methods for diagnosis and treatment of 
human inherited diseases. 

The topics that are covered and the approach we take 
to describing them make this book suitable for under- 
graduate and graduate students in molecular biology, 
ceil biology, biochemistry, genetics, or biotechnology 
courses; for medical students and physicians; and for 
others who have an interest in recombinant DNA tech- 
niques — for example, forensic scientists, patent attor- 
neys, and science journalists. 



XIV 



ACKNOWLEDGMENTS 



Textbooks dealing with biochemistry, molecular' cr e - 
nencs, and molecular biology usually present information 
wuhoi.t describing the experiments that were done to 
obtain it. We think that this is a pitv, because desicmincr 
and doing experiments is exciting and fun. As in the first 
edition, we have used real experiments to illustrate im- 
portant biological phenomena, and we have plundered 
our colleagues 1 papers for interesting examples. Fimires 
are used profusely to try to make complex real-life ex- 
periments intelligible, but inevitably we have not been 
able to present all the subtle details. Those who want to 
explore these details will find the expenments in the 
research papers listed at the end of each chapter, and 
the review papers we cite will provide an entry point to 
each topic. 



This book is atypical in another regard. Because we 
do not consider it primarily a textbook for conveying 
undisputed facts about molecular biology, we have been 
able to include exciting research that; is at the cutting 
edge of biology. The interpretation of experimental data 
-often changes with time, so the reader should bear in 
mind that future research might require modification of 
some of the ideas we present. This is all part and parcel 
of doing research, because a science that does not change 
is a dead science. Modern experimental research in bi- 
ology is an ever-changing dynamic enterprise, and we 
hope that Recombinant DNA conveys the excitement of 
the continuing process of discovering how organisms 
work 



Acknowledgments 



Those who most deserve our thanks are our families, 
from whom we were taken for many days by this book, 
and our colleagues at Cold Spring Harbor Laboratory, 
and Genentech, who sometimes had a difficult time com- 
municating with us when we were preoccupied with 



writing. 



We are grateful to the many friends and colleagues 
who read our manuscript, criticized, corrected, and pro- 
vided information. They include Sue Alpert, French An- 
derson, Avi Ashkenazi, David Beach, Martin Bobrow, 
Tom Caskey Jeff Chamberlain, Irvin Chen, Francis Col- 
lins, Alan Coulson, David Cox, Ken Culver, Kay Davies, 
Jim Eberwine, Stan Fields, Ted Friedman, Bruce 
Futcher, Peter Gergen, Richard Gibbs, Paul Godowski, 
Andre Goffeau, Takashi Gojobori, Kenshi Hayashi Dan 
Hard, Andrew Hake. Tom Hynes, Paula Jard ten, Karen 
Johnson, Dan Kiessig, Jeff Kuret, Make Laspia, Philip 
Leder, Fred Ledley, V i ncent Marches!, Rob Martienssen, 
Dusty Miller, Rick Myers, Karoly Nikolics, Luis Parada, 
.Scott Putney, Don Rio, David Schiessinger, Matt Scott 
John Sulston, Barbara Trask, Rebecca ■ Ward, Robin 
Weiss Jim. Wells, Ted White, and Bob Williamson. Any 
errors, nevertheless, are ours, and not theirs. ' - 

Elizabeth Zayatz,. the development editor, had the 
unenviable job of trying to make us concentrate on the 
work at hand when w r e wanted to be doing other things. 



She did wonderfully well keeping us at it and became 
a trusted friend Bill O'Neal and Jodi Simpson, the man - 
uscript editors, smoothed awkward passages and made 
us think more carefully about the information we were 
trying to eonveyjanet Tannenbaurn, the project editor, 
combed the manuscript and graphics with a thoroughness 
that must have required a magnifying glass. She guided 
us through a forest of edited manuscript, galleys, and 
page proofs. At times it seemed that never a day went 
by without an overnight package from Janet arriving on 
our desks. Alison Lew gave the text, illustrations, and 
cover their design and Bill Page looked after the art 
program — tasks that rapidly assumed epic proportions 
as we all worked to produce the figures that are an 
essential complement to the text The figures were ren- 
dered by Network Graphics and Torno Narashima. The 
beautiful and remarkable cover art is by Marvin Mat- 
telson. Listening to him expound on surrealism and 
watching him sketch were highlights of the production. 
Julia De Rosa masterfully coordinated the production 
process to allow rapid completion of the project. Linda 
Chaput, President of Scientific A m erican Books, was al- 
ways patient and understanding. Her guidance and advice 
were invaluable when the going got tough, and our dis- 
cussions with her were rewarding, and fun. 



James D. Watson Michael Gilman 
Jan Witkowski Mark Zoller 

December, 1991 



H A P 7 E R 



In Vitro 




I # ecombinant DNA technology and DNA sequencing provided the tools 
J\ • .to clone and characterize genes. As we learned in Chapter 8, simple 
Inspection of gene sequences told us- much about genomic organization. Func- 
tional sequences, such as. transcriptional control elements, could often be iden- 
tified by comparing sequences of a number of genes. However, to delve deeply 
Into the structure and function of genes required the ability to change the DNA 
sequence and examine the effect of the change on gene function. For decades 
before the advent of recombinant DNA, this was done by classical genetics, the 
Identification of mutant organisms with new properties- From the genetic prop- 
erties of mutants, information about the structure and function of the unde rlying 
genes could often be Inferred. This approach, however, was limited to organisms 
in which simple generic analysis was possible — bacteria, yeast, fruit Mies. Generic 
analysis of more complex, longer-lived organisms like mice and men was slow 
and difficult 

Recombinant DNA changed all that. The ability to Isolate genes as molecular 
clones, the development of tools to modify gene sequences in rhe test rube, and 
the power to return altered genes to the organism to test their function have 
revolutionized the way generics is done in higher organisms. Because we now 
often work "backwards" from gene sequence to gene function, in contrast to 





191 



192 



CHAPTER 11 



Amp 1 ' 



DNA of inferesf 7 




Plasm id DNA 



In vifro mutagenesis 




Mutation 



Transform E. coil 
Select Amp R colonies 



row colony 
J \ isolate plasm id DNA 



Culture al 
colonies fog ether 




A m uf an t 
plasmid 



Test for function 



Library 
of mutant 
piasmids 



Transform E. cofi 

Test for function using 
-genetic screen or selection 

FIGURE 11-1 

General strategy for an in vitro mutagenesis experiment. 
Most procedures for in vitro mutagenesis follow the same 
basic scheme: Plasmid DNA is "mutagenized" in vitro, then 
introduced into £ coil by transformation. Depending on the 
method, mutant clones can be isolated and tested individ-- 
ually, or a library of mutant piasmids can be obtained, 
-which are tested using a genetic screen. 



classical generics, this new approach spawned by re 
conihmaru DNA is called reverse genetics. In this chapter 
we will learn ways to alter the sequence of a cloned 
gene at will and how these methods are used to un- 
derstand the structure and function of genes and gene 
products. 

In Vitro Mutagenesis. Is Used 
to Study Gene Function 

In vitro mutagenesis of cloned genes has become a 
standard tool in the functional analysis of nucleic acids 
and proteins. Most procedures follow the same basic 
scheme (Figure 11-1). Plasmid DNA containing the 
gene of interest is treated m vitro by some mutagenesis 
procedure that alters the DNA either chemically or 
enzymatically. The mutagenized plasmid DNA is in- 
troduced into .£ coli by transformation, and colonies 
containing plasmid molecules are selected by anti- 
biotic resistance. Mutants can be made one at a time, 
or hundreds of different mutants can be created in a 
single mutagenesis experiment Mutant piasmids can 
be isolated from single colonies and tested individ- 
ually Alternatively plasmid DNA can be prepared 
from pooled colonies and the resulting library tested 
en masse to identify mutant piasmids. 

The various approaches to mutagenesis can be 
grouped broadly into random and site-directed meth- 
ods. Random methods put mutations anywhere in a 
plasmid. They are best used to identify the location 
and boundaries of a particular function within a cloned 
DNA fragment and are most readily used for this 
purpose when a simple genetic screen (or selection) 
is available. A genetic screen or selection consists of 
a system to test the function of the DNA of interest 
m cells without having to isolate each plasmid indiv- 
idually. Random mutagenesis is often used as a first 
step, when little is known about the function encoded 
by particular DNA fragment. Analysis of random mu- 
tants generally provides only a simple identification 
of the functional region but does not explain how 
things work on a molecular level The value of such 
a strategy is that it quickly helps to narrow down the 
focus of attention from a large DNA fragment to a 
smaller region that can be studied subsequently in 
greater detail As we will learn, random mutagenesis 
can be accomplished by several different methods, 
such as altering the sequences within restriction en- 



jl y ...J 



donuclease sites, inserting- an oligonucleotide linker 
randomly into a plasmid. damaging plasmid DNA m 
vitro with chemicals, or incorporating incorrect nu- 
cleotides during in vitro DNA synthesis. 

Once an important functional domain m a gene has 
been identified by random mutagenesis, site- directed 
methods — putting mutations precisely where they are 
needed — are used to define the role of specific se- 
quences. In addition, directed mutagenesis provides a 
powerful tool For the analysis of protein function, by 
allowing researchers to make specific and subtle 
changes in the structure of the protein. A number of 

strategies have been, developed to construct sue-dP 
____ 

lTbe^ltccornp^ synthetic oligo nucleoacles. 

Wkh an oligonucleotide the desired sequ ence is-sim - 
jH^HbTnl^^ the wild-type framework. Nowadays, 
oIT^ira^ reactions are 

relatively straightforward, and oligonucleotides are 
cheap and easy to obtain. The limitation of site-di- 
rected mutagenesis is that you must already have 
enough information to know what you wish to change- 
There are two standard ways of using oligonucleo- 
tides to construct site-directed mutants: mutagenesis 
by gene synthesis and mutagenesis by enzymatic ex- 
tension of a mutagenic oligonucleotide. By using de- 
generate oligonucleotides (see Chapter 7) a set of 
"random" mutadons at a specific site can also be made. 

Restriction Endomiclease Sites Provide 
the Simplest Access for Mutagenesis 

One of the first experiments done with a cloned DNA 
fragment is to map the positions of restriction endo- 
miclease cleavage sites in the DNA by using a battery 
of different enzymes. Although this information could 
be precisely obtained from the DNA sequence, map- 
ping restriction sites can be accomplished rapidly and 
is often done in conjunction with sequencing. Re- 
striction endo nuclease recognition sites provide the 
simplest way to modify a DNA clone in vitro (Figure 
11-2)- Cleaving plasmid DNA with a restriction en- 
zyme that recognizes only one site produces a linear 
molecule. This serves as an entry point for modifying 
the DNA sequence in the vicinity of the restriction 
site. For example, the enzyme EcoRl recognizes the 
sequence GAATTC and produces ends with 5' ov- 
erhangs. The ends can he made even (blunt) by treating 




FIGURE 11-. 2 ' ' 

Creating a mutation by manipu lation of a restriction site- 
Pi as mid DNA is cleaved with EcoRl restriction endonu- 

clease, which generates a linear fragment with 5' ends char 
have four unpaired nucleotides (so-called sticky ends). - 
Treatment with SI nuclease {left} removes these nucleo- 
tides, and the linear fragment is then treated with DNA U 
£ase. The resulting circular molecule contains a deletion of 
4 bp- Alternatively, addition of DNA polymerase and deoxy- 
rlbonucleotide triphosphates (dNTPs) to the plasmid cleaved 

EcoRl extends the 3 5 ends by DNA synthesis (right). 
After ligation, the resulting molecule contains an insertion 
of 4 bp-" In both cases, the EcoRl site has been destroyed. 



LH A Jr 



R 



FIGURE 11-3 

Lmker mse[ tlon mm -genesis to map factional domains of a 
bacterial rransposable element. The scarring plasrrnd contains 
an. intact transposon, an ampicillin-resistarice a ene for selec- 
n0i ;^ n E - coli t and sequences for plasmid replication. The 
DNA is treated with a low concentration, of deoxyribonu- 
clease I in the presence of Mrr T Under these conditions, 
the enzyme makes double-stranded cuts at random positions 
in the plasmid, generating a collection of linear DNA mole- 
cules broken at different positions. Oligonucleotide linkers 
encoding an EcoRl restriction site are added to the ends 
with DNA iigase, the linear molecules are' treated with 
EcoRI endonuclease to create, sticky ends on the linkers, and 
the molecules are ^circularized. The circular molecules are 
transformed into E. colt, and ampicillin-resisrant colonies are 
selected- Plasmid DNA is isolated from individual colonies, 
introduced into another strain of £ colt, and tested for activ- 
ity of the transposon. The positions of the inserted linkers 
are mapped by restriction digestion. Linkers inserted in one 
region (blue) of the plasmid inactivated the transposon. No 
linker insertions in the ampicillim-resistance gene were re- 
covered, because these plasm ids would fail to yield a drug- 
resistant colony m the original selection of transformed 
E. coll 



the cleaved DNA with DNA polymerase in the pres- 
ence deoxyribonucleotide triphosphates. The two 
blunt ends can then be linked together again (ligated) 
by incubating the linear plasmid molecule with DNA 
ligase. A few nanograms of DNA from the in vitro 
ligation reaction is used to transform £ colt, and the 
new modified plasmid is isolated from one of the re- 
sulting colonies. The net result of these manipulations 
is to Insert 4 bp into the plasmid at the EcolU site. 
Alternatively, a small deletion mutation can be made 
by treating the linearized DNA with Si nuclease, 
which specifically digests single-stranded DNA. This 
creates blunt ends by removal of the four nucleotides 
that constitute the 5' overhang generated by EcoRI at 
each end. Subsequent ligation' of the DNA Into a co- 
valently closed circular molecule thus results In the 
deletion of 4 bp from the DNA, In each example, the 
new sequence no longer encodes the EcolU recognition 
site. These types of manipulations, if done to a protein 
coding sequence, would change the translation^ read™ ' 
mg frame, resulting m production of a grossly altered 
protein. The major limitation of using restriction sites 
to make mutations is that there simply may not be 
sites in regions of the gene the experimenter wishes 
to alter. 



I. 



-a\ 



XN 



1 yv 



trans 



poson 



DNase I 
Mn 2+ " ion: 



1 L- 



2 (Z 

3 Li, 



TAILI 



• c 



. Linear DNA 



4 i:.vc 
















DNA 






Ligase 


O EcoRl linkers 




fT 



































molecules 



Digest with EcoRl endonuclease 
Circularize with DNA ligase 




nsertea' 
inker 
destroys 
Amp R gene 



Transform £ coll 
Select Amp R colonies 




„ Prepare plasmid 
DNA from each 
ony 



Introduce into E. colt 
Test for transposition 



Inactive 



A- 



chve 



la Vicro Mutagenesis 



195 



Linker Insertion. Is Used to Map a 
Bacterial Transposon 

We have learned that it is a simple matter to cleave 
a plasmid with a restriction enzyme, blunt the ends 
by treatment with a ON A polymerase, and rejoin them 
by ligation. A variation on this technique is to rejoin 
the ends in the presence of a synthetic oligonucleotide 
'linker/' often one that encodes a restriction site. In - 
sertion of the linker disrupts the gene sequence; the 
position of the inserted linker can he easily mapped 
by cleavage of the plasmid with the restriction enzyme 
that cuts the linker. 

A similar method was used to define the functional 
regions of a bacterial transposable element (a "jumping 
gene/' see Chapter 10), by inserting linkers at many 
alternative positions throughout the element- To place 
linker insertions in the transposon, a plasmid carrying 
a clone of the transposon was treated with a nuclease 
that cleaved the plasmids at random positions (Figure 
11-3). Cleavage conditions were adjusted so that each 
plasmid was cut just once on average. The linearized 
molecules were isolated and ligated into circles again 
in the presence of an 8-bp linker oligonucleotide con™ 
taiaing an EcoRl restriction site, resulting in insertion 
of the linkers into random sites, one in each plasmid. 
The resulting plasmids were transformed into E. colt 
and, using a genetic screen, examined to see if the 
transposon could jump. Insertion of a linker into a 
region of the transposon critical for its function in- 
activates it, presumably by putting a protein-coding 
sequence out of frame. By mapping the positions of 
the inserted linkers by restriction analysis, the loca- 
tions of functional regions of the transposon were 
deduced. 



Construction of Nested Deletions Maps 
the Boundaries of a Transcriptional 
Control Region 

Transcription of the gene encoding the 5S ribosomal 
RNA. molecule is carried out by RNA polymerase III 
(pol lit, see Chapter 8). To identify the sequences 
within the 5S gene required for transcription by pol 
III, a series of d eletion mutations was made and tested 



for their ability to support accurate transcription. Two 
sets of deletions were made. One was made by cutting 
a plasmid carrying a cloned 5S gene at a restriction 
site on the 5 r side of the gene. The linearized plasmid 

was treated with a combination of nucleases that di 

crested away DMA from rhe ends of the molecule 
(Figure 11-4). The amount of DNA removed was 
controlled by varying the time, temperature, or en- 
zyme concentration in the reaction. A second set of 
deletions was generated from plasmid DNA cleave d. 
at a site on the 3' side of the gene. The result was 
two sets of plasmids with progressively larger deletions 
toward the gene from both directions. Testing these 
genes revealed drat only deletions entering a 3 5 -bp. 
region within the transcribed region of the 58 gene 
abolished transcription by pol III. Therefore, this dele- 
tion analysis mapped the transcriptional regulatory 
element to this 35-bp stretch, which has subsequently 
been analyzed in much greater detail by site-directed 
mutagenesis. 

Several different types of enzymes can be used to 
produce deletions. Generally, these enzymes delete 
DNA from both ends of a linearized plasmid molecule. 
Often, however, one end of the molecule contains 
sequences that need to be retained in the plasmid 
because, for example, they are required for plasmid 
replication- In the 5S gene deletion experiment, this 
limitation was accommodated by isolating the deleted 
gene fragments and recloning them into a new vector: 
Alternatively, a strategy can be used that limits dele- 
tion to one end of a linearized plasmid molecule (Fig- 
ure 11-5). This method is widely used to generate 
nested deletions for DNA sequencing (see Chapter 7). 



Linker-Scanning Mutagenesis Permits 
Systematic Analysis of Promoters 

Deletion mutagenesis of the 5S gene mapped the 
boundaries of the transcriptional control region in the 
gene. But not all the nucleotides within the boundaries 
of that 35-bp region are necessarily critical for franc- 
tiom Therefore, methods were needed to change in- 
dividual nucleotides in a target without generating 
oross deletions or other rearrangements. This was ac- 
complished for a viral promoter using an elegant ad- 



1 yo j C fl AFTER II 



Cleove with 
resiTicHon 
\ \ enzyme at A 




M Oligonucleotide linker 




1M 



Cut at B and C 

Isolate 5S gene fragments 





Li gate info new vector 
Transform £L colt 
Isolate plasmid DNAs 




Rightward deletion mutants 



FIGURE i 1-4 

Construction of a nested sec of deletion mutants to map rhe 
transcription control region of a 5S ribosomal RNA gene, 
0) A plssmid clone was linearized with a restrktion°enzyine 
at a position (A) on the 5' side of the gene. The linear ffaer- 
menrs were treated with an exo nuclease, which digests 
DNA from both ends of the molecule. Portions ofthe reac- 
uon were removed at different times to recover populations 
of molecules with progressively larger deletions. Linkers 
were added to the ends, and the molecules were cleaved 
with restriction enzymes specific for'sites B and C to sepa- 
rate the SS gene fragments from the remnants of the vector. 
The fragments were recloned into a new vector, generating 
the sec of righnvard deletion mutants. To create the left- 
ward deletion mutants, this process was repeated after 
cleaving the plasmid at restriction sue B (b) Individual plas- 
- raids were isolated after transformation, their deletion end- 
points determined by DNA sequencing, and their ability to 
support transcription by RNA polymerase III tested with an 
in vitro assay. As can be seen by comparing transcription 
■ activity with the extent of deletion, transcription is inhibited 
when the rightward (5') deletions enter the +40 region and 
when the leftward (3') deletions pass the +80 point. This 
suggests that the transcription control region lies between 
+ 40 and +80. 



Set of 
rightward 
deletion 
mutants 



Vector DNA 



Full-length 

55 ribosomal 
RNA gene 



Set of 
leftward 
deletion 
mutants 




40 
i 



A 



80 



53 RNA transcript 
1 20 1 60 200 

> I ! 



Control sequences (40-80) 
lie within transcribed region 



B 



^^^^ • ,TT' ; " ••• - " 



2^ "^^^ 



Vector DNA 



Vector DNA 



{b} 



+ 



+ 

+ 



In Vitro Mutagenesis 



197 




a — 



<y y Cloned cDNA 



Cleave DNA with P$t\ and BamHl 



S' GATCC F^S^il^^^ 

3 ' Gfeiffl^MiSi 



ctgca 



Exonuclease 1U 



5 GATCCfe§S^^M 



IPlPi CTGCA 



GATCC ^^^^CT 



ct&ca 



.Single- stranded ON A 



Nuclease SI 




Ligafe linker 



Promoter 
\ ntoct 




Linker 




cDNA remaining 
after deletion 



Extent of 

deletion 

controlled 

by time 

of digestion 



apration of deletion mutagenesis cali.ee!. linker scanning 
Using- the methods outlined in. Figure 11-4, cwo sets 
o f p i as mi ds w e r e co ns tru c ted that co n t a i n e d d c I en o ns 
within the promoter.. One set of deletions started from 
a site beyond the 5 f end and proceeded toward the 
crene, leaving the 3' end intact: the other set started 
at a point within the gene and proceeded in die op- 
posite direction, leaving the 5 f end intact. Each dele- 
tion terminated with a 10-bp BamHl linker. The extent 
of the deletion in the DNA was determined for each 
pi as mid by DNA sequencing. Pairs of pi as mid s from 
the two deletion sets with end points 10 hp apart were 
recomhined at their BamHl sites (Figure li-6). The 
effect was to preserve the length and organization of 
the promoter— thought to be important for promoter 
function — but to replace various 10-bp segments of 
wild-type promoter sequence with the sequence in the 
linker. Thus, this experiment created a library of pro- 
moter mutants of similar structure but with nucleotide 
substitutions clustered within 10-bp windows located 
at various sites in the promoter. This collection of 
mutants spanned the length of the promoter. The re- 
sults of this analysis were discussed in Chapter 9. At 
the time 7 this experiment represented the most thor- 
ough analysis of a promoter in a mammalian gene. 



FIGURE 11-5 

Construction of unidirectional deletions using exo nuclease 
III. Exonuclease III attacks preferentially the 3' end of a lin- 
ear DNA molecule with 5' protruding nucleotides. There- 
fore, by cleaving a plasmid molecule at adjacent sites with 
BamHl, which leaves a 5' overhang, and /Vd, which leaves a 
y overhang, only the end generated by BamHl is attacked 
by exonuclease ILL After exonuclease HI treatment, the re 
maining single-stranded tail (along with the overhang at the 
other end) is removed with Si nuclease, which digests only 
single-stranded DNA. An oligonucleotide linker is attached, 
and the fragments are heated to form closed circular mole- 
cules. In the experiment shown here, deletions are being 
used to map the functional domains of a cloned gene in- 
serted in an expression vector This strategy allows dele- 
tions to be made only -in the cloned gene, without damaging 
the promoter sequence. 



Random Nucleotide Substitutions Are 
Obtained by Chemical Modification of 
DNA or by Enzymatic Misincorporation 

While linker scanning allows the creation of nucleo- 
tide substitutions, each mutant generally contains sev - 
eral substitutions, and the positions of the mutations 
depend on the availability of appropriately placed 
deletions. Therefore, several strategies have been de 
ve loped for placing single nucleotide substitutions at 
random positions in a DNA molecule. The simplest 
methods employ chemicals that modify or damage 
DNA Generally, plasmid DNA or DNA fragments 
are treated with chemicals, transformed into E. colu 
and propagated as a library of mutant plasmids. Chem- 
icals most commonly used for in vitro mutagenesis 
include sodium bisulfite, which deaminates cytosine 
residues to uracil, and reagents that damage or remove 



JL y O 



UHAPTE 




HindlU 

Cleave wifh Bglti Y 

Digest wifli exonuclease IK promoter 

Ligafe BamHl linker g|f / {140 bp) 
Circularize ■ jf 




TK coding sequence 



3' deletion 
mutants ^ 



H/ndlil 




Cleave with H/ndlll 
Digest with exonuclease 
Ugate BamHl linker g| 
Circularize 



\ 

BamHl 




Hha\l\ 



HmdU 



Promoter \ / Promoter 
fragment from \f fragment from 
Plasmid C 1/ Pfasmid Z 



LtgaN 




SamHI 



Recombinant plasmid 




Linker -scanning 
mutant 



Wifd-rype 
sequence 

FIGURE 11-6 ' ■ 

S L e^oTdTnon g mUtageneSIS ° f * c Wal Poorer for the thymidine ktuase (TK) gcae Two 
.colon, co- rnaWhe e * ttnt of deieuo™ 

were sennpn^H ^ ■ u - i i - ' -^-ppi-uAimaceiy one nuadred plasrmds 

deletion of the other Lgn.en^X^ ^^KSf ^ ^ 

iO-bp BamHl linker in Sace of rhp ype * Q se 1--e — pt for the substitution of the 

exaiple shown JTin t tefoT" h^T ^ CW ° L ddeMn ^ the 

f , H.I5, -esuirs m a cluster of e Ig ht nucleotide substimtioas (arrows). 



Single-stranded DNA 



Double stranded plasrnid 



Cytosine 



H 



H 




y 



FIGURE 11-7 

Chemical mutagenesis using sodium, bisulfite. Sodium bisul- 
fite reacts with cytosine bases of single-straaded DNA to 
convert them to uracil, a thymine analog that base-pairs 
with adenine. Single-stranded DNA is treated with sodium 
bisu lfite to modify a small -number of cytosine residues in 
each molecule. An oligonucleotide primer is annealed to the 
DNA and serves as a primer for synthesis by DNA poly- 
merase. When the polymerase encounters a uracil in the 
template strand, it incorporates an adenine into the newly 
synthesized DNA, Since the vector sequences are also dam- 
aged by bisulfite treatment, it is necessary to excise the dou- 
ble-stranded DNA fragment by restriction endonuclease 
cleavage and redone it into an undamaged vector. Follow- 
ing transformation into R colt, a library of mutant plasmids 
can be isolated or individual plasmids can be purified and 
tested. The average number of substitutions in the DNA 
fragmen t can be controlled by altering the conditions of bi- 
sulfite treatment. 



bases, thereby preventing normal Watson-Crick base- 
pairtng (these include hydrazine and formic acid, 
which are used in Gilbert- Max am DNA sequencing, 
Chapter 5), Most often, chemical mutagenesis is per 
formed oa single-straaded DNA and followed by in 
vitro synthesis of the complementary strand using a 
DNA polymerase (Figure 11-7). This synthesis in- 
corporates the mutation into the new strand. In DNA 
treated with bisulfite, an adenine nucleotide is incor- 
porated opposite the uracil; after transformation into 




Trea f w i th sod i u m bisulfite 
(NaH50 3 ) 





Mixture of 
modified 
tJ/ plasmids 



■ - n 



Anneal 

oligonucleotide JtlJ^ 
primer v , 1f , 




DNA polymerase + dNTPs 



Poiyrn erase inserts 
adenine 'opposite uraci 




Remove 

damaged 

vector 



Cleave DNA with HindtW and EcoR\ 
Isolate mixture of mutant Fragments 



Li gate to new vector 
Introduce into E. colt 
Isolate plasmids 





Mutant 

pfasmid 

library 



199 



wo 



CHAPTER If 



\ 11 



DNA fro qment 
cloned em a vector 




ft Single- stranded DNA 



ternolafe 



^Anneal mutagenic 
oligonucleotide primer 



Perfectly matched 
bases Hank mismatch 



Mismatched bases ^ 




nucleotide 



DNA polymerase 
dNTPs 
f DNA Ifgase 




Complemen tar.y 
strand synthesized 



FIGURE 11 -8 

Oligonucleoade-directed mutagenesis by enzymatic primer 
extension. A "mutagenic" oligonucleotide encoding the de- 
sired mutation embedded in wild-type flankincr sequence is 
annealed ro a. single-stranded DNA template. The sequence 
ot the oligonueleoude is complementary to the template ex- 
cept tor the nucleotides that define the mutation. Generally, 
the mutagenic oligomer is designed, so that the mismatched' 
nucleotides are positioned m the middle and there are at 
least 8 to 12 nucleotides on either side that base-pair whh 
the template DNA, -The mutagenic oligonucleotide serves as 
a primer for DNA synthesis by DNA polymerase. Once rhe 
entire template has been copied, the end/ of the newly syn- 
thesized strand are covalently linked by DNA hgase Th- 
heteroduplex DNA is transformed into E colt. Theoretically 
both strands can replicate, segregating into separate mutant' 
and wild-type plasmids. In practice, however, most colonies 
contain only one or the other, because enzymes m the cell 
recognize and repair mismatched nucleotides in the hetero- 
duplex before replication. Plasmid DNA is Isolated from the 
resulting colonies and is screened to identify mutants. 



Mutant strand - 



Ligase joins ends - 




Wild-type strand 



Transform E. colt 
' Select Amp R colonies 



Some colonies 
contain mutant 
Others contain 
wt Id-type DNA 





solate plasmids 




Mutant 



Wild type 



K coh] the wild-type OG base pair becomes a T-A 
pair. In DNA treated with reagents that eliminate 
bases, any nucleotide can be incorporated opposite die 
' abasic" site, which still retains its deoxyribose back- 
■ bone although it has lost its base. The major Limitation 
of chemical mutagenesis is the specificity of the in- 
dividual reagents: bisulfite mutagenesis, for example, 
changes only cytosines. 

All possible nucleotide substitutions can be gen- 
erated using enzymatic misincorpbration. Here" the 



strategy is to perform in vitro DNA synthesis under 
nomdeal conditions— suboptimal ionic conditions, 
unbalanced concentrations of nucleotide precursors— 
that encourage DNA polymerase occasionally to in- 
corporate the wrong nucleotide during synthesis. For 
example, synthesis is carried out in the presence of 
high concentrations of three of the precursors and a 
very low concentration of the fourth. At positions that 
normally call for the fourth (scarce) nucleotide, one 
of the others is sometimes incorporated instead These 
methods also exploit DNA polymerases that lack a 
proofreading activity -a V to 5 r exonuclease mech- 
anism drat checks each base pair after incorporadon 
and removes nucleotides that are mismatched. Therrnus 
aquaticus (Taq) DNA polymerase, used in the poly- 
merase chain reaction (Chapter 6), lacks such an ac- 
tivity. Though this is a problem when accuracy of 
synthesis Is required, the PGR is a very simple and 
efficient way to introduce random nucleotide substi- 
tudons into a DNA fragment. 

A general problem with random mutagenesis ap- 
proaches is that they often produce mutants with more 
than one substitution. Multiple substitutions m a single 
mutant complicate the interpretation of an experi- 
ment, because it isn't clear which substitution (or 
which combination of substitutions) is responsible for 



In Vitro Mutagenesis 



201 



observed changes in rhe pro perries of the mutant. Ex- 
traordinary methods have been used, to circumvent 

this problem- -essentially, significantly reducing the 

extent of mutagenesis and using enrichment protocols 

to find rare mutants-- —but almost ah these procedures 

have been supplanted by new methods that use syn- 
thetic oligonucleotides. 



S y n t h e 1 1 c Oligonucleotides 
Facilitate Mutagenesis 

Most of the methods for mutagenesis we have dis- 
cussed so far have some significant shortcoming— they 
rely on fortuitous access to a sequence via a restriction 
site, forced entry through deletion strategies, or te- 
dious screens to find randomly generated mutations 
in the region of interest. To be most powerful, mu-. 
tagenesis must allow the experimenter to place any 
mod inc a ti on at any position desired In clorieo^TLTNTL 
TrTTs7n"aTl^ecome not o nl^plislmbie, but simple and 
cheap ? with tire advent of synthetic DNA oligonu" 
cleotides. Oligonucleotides provIHe the means to de~- 
sign a particular mutation and then to place it precisely 
where you want it ' ' 

The simplest method for doing oligonucleotide- 
directed mutagenesis is by enzymatic primer extension 
(Figure . 1 1-8). In this method, an oligonucleotide is 
designed that carries the mutation flanked by 10 to 
15 nucleotides of wild- type sequence. This "muta- 
genic' 7 oligonucleotide is hybridized to its comple- 
mentary sequence in single-stranded wild- type DNA 
prepared from a phage or phagemid clone, forming a 
heteroduplex with mismatched nucleotides at the site 
of the mutation. Although the oligonucleotide is not 
perfectly complementary, it will, anneal if the hybrid- 
ization conditions are not very stringent. The oligo- 
nucleotide serves as a primer for in vitro enzymatic 
DNA synthesis by a DNA polymerase that converts 
the single-stranded DNA into double stranded form, 
using the wild- type strand as template. In this way f 
all regions of the plasmid except the region containing 
the mutagenic oligonucleotide will be wild-type in 
sequence. Once the primer has been extended com- 
pletely around the template, the ends of the newly 
synthesized strand are ligated, forming a double- 



stranded circular DNA molecule." This heteroduplex 

DNA - -one strand has the wild- true sequence and the 

other strand .has the mutant sequence is transformed 

into E, colt, where either strand can be replicated. By 
the time a colony grows up, however, it usually con- 
tains only one type of plasmid, wild-type or mutant. 
The types of mutations that can be made by this ap- 
proach range from single nucleotide substitutions to 
cleletions oFinsertions, limited only by the size of the 
oligonucleotide needed. 



Mutant Clones Can Be Identified by 
Hybridization and DNA Sequencing 

Theoretically, half the daughter molecules of a mu- 
tagenesis reaction will be wild -type and half mutant. 
In practice, however, the precentage of mutant plas- 
mids is often much lower. This is due to a variety. of 
technical factors, but the consequence is that methods 
for identifying or enriching mutant clones are vital. 
Mutant molecules can be distinguished from wild-type 
if there is gam or loss of a restriction site. Alternatively, 
the oligonucleotide that was originally used to make 
. the mutation can be used as a hybridization probe to 
distinguish mutant from wild-type molecules (Figure 
11-9). The mutagenic oligonucleotide is radio actively 
labeled with 32 P ATP and hybridized to DNA from 
bacterial colonies on nitrocellulose filters, as described 
in Chapter 7. If the temperature of the hybridization 
is raised in 5 or 1G"C increments, a point can usually 
be reached at which the labeled oligonucleotide will 
hybridize only to the mutant molecules (to which it' 
is perfectly complementary) and not to the wild-type 
molecules, because the hybrid is destabilized, by the 
mismatched nucleotides. Plasmid DNA is isolated 
from an £1 coli colony that strongly hybridizes to the 
probe. Verification that the desired mutation was made 
is accomplished by sequencing the DNA of this pu- 
tative mutant clone. This technique can identify one 
mutant clone among several hundred wild- type clones. 

Several clever methods enrich for mutant clones so 
that the tedious task of screening by hybridization is 
not necessary. In one of diese techniques, the template 
DNA is biologically marked so that it is destroyed 
after transformation into £ coli and the mutant strand 



CHAPTER II 



DceiJulos. 



Mixture of colonies 
confaini ng mufonf 
"- and wild -type 

TT;: : >''*' p I a s m f d s 

Make a 
replica filter 



Denature DNA 
Neu trail 



32p 

Radioactive 
m utagenic 

o 1 i g o n ucleoH cf e 




Labeled probe 

hybridizes 

at room 

fern pe rate re to 

both mutant and 

wild-type DNA 



Autoradi- 
ography 




Wash at higher 
temperature 



Wild-type 
missing 



Probe remains 
bound to mutant 
DNA 



Probe dissociates 
from wild-type DNA 




•"•'/'-./•-, . x Isolate mutant 

'yN/O*';) DNA from 
•< "TyC.yy colon/ 



Master plate 



FIGURE 1 i 9 

Searching for mutant" piasmids using >h. mu caeenic oligonu- 
cleotide as a probe. Colonies (or plaques) resulting from 
era a storm anon, by murage razed piasmids (see Fiacre 11-R) 
are prepared for colony hybridization en nitrocellulose hb 
ters using methods described m Chapcer 7. The mutagenic 
oligonucleotide is radioactively labeled by phosphor/latin^ 
its y end using J2 P-ATP and polynucleotide kinase. The Li- 
beled oligonucleotide is hybridized to the plasmid DW on 
the nitrocellulose hirers. At low temperature, the oligonu- 
cleotide will hybridize ro both mutant and wild-type DNAs. 
As the temperature is increased, the mismatched oligonu- 
cleotide hybridized to the wild-type plasmid DNA begins to 
dissociate from the wild-type clones. Eventually a tempera- 
ture is reached ar which the mismatched oligomers com- " 
pletely dissociate from the wild-type clones but remain 
hybridized to the mutants. Since the oligonucleotide is ra- 
dioacnvely labeled, the nitrocellulose fiber is exposed to 
x-ray film and mutant clones are identified by the presence 
of a strong signal on the autoradiograph. Mutant plasmid 
DNA is then isolated from the corresponding colony on the 
master plate, using the replica filter as a euide. : 



is preferentially replicated (figure 1 1-10). In a second 
method, the template strand is enzymatically de- 
stroyed before transformation. Both methods can yield 
mutants at a frequency of greater than 50 percent, so 
that plasmtd DNA is simply isolated from three or 
four randomly picked colonies and analyzed by DNA 
sequencing with the expectation that a mutant will be 
found among the DNA selected. 



Oligonucleotide Cassettes Provide a 
Simple Method for Introducing 
Directed Mutations ' ^ 

We learned earlier that restriction enzyme sites pro- 
vide access to a cloned DNA for mutagenesis. If two 
restriction sites are close together, the intervening 
fragment can be removed and replaced with a synthetic 
double-stranded fragment (a cassette) made from two 
complementary single-stranded oligonucleotides car- 
rying any desired sequence. Often, however, conve- 
nient restriction sites are not available; fortunately, it 
is a simple matter to create them using the oligonu- 
cleotide-directed mutagenesis procedures described in 
the previous sections. Once the sites are in place, any 



In Vitro Mutagenesis 



203 



FIGURE I i -10 

Enrichment for oligomicleotide-d i rected mutants by using a 
utacil-contaming template. Single-stranded template DNA is 
prepared in a strain- of E. col: that lacks the enzyme uracil 
degiycQSKb.se (ung"), so that it contains several, uracil resi- 
dues in place of thymines. (Although uracil is not usually 
incorporated into DNA, it is not actually mutagenic and it 
does form a base-pair with adenine.) The mutagenic oligo- 
nucleotide is annealed and primes the synthesis of a strand 
that extends around the template in a reaction using the 
four standard dNTPs (as in Figure 11-8). Following ligation, 
the heteroduplex DNA molecules are introduced into an 
ung + strain of E. coll Once in the cell, the wild- type (tem- 
plate) strand is attacked by uracil deglycosidase, which 
causes breaks m the DNA strand, and the DNA strand is 
degraded before it can be replicated. Since the strand con™ 
taming the mutagenic oligonucleotide does not contain ura- 
cil, it is not attacked and is replicated normally. When this 
procedure is used, 50 percent or more of colonies contain 
mutant plasmids. 




'an storm 



ung 



E. colt 



Prepare 

si ngle-sf ended 

DNA 




Uracil 

incorporated 
info template 
DNA ■ 



Anneal mutagenic 
oligonucleotide 



number of new mutants can be made by inserting 
synthetic fragments into the plasmid (Figure 11-11), 
just as different cassettes can be inserted into a tape 
player. 

This method of cassette mutagenesis was the basis 
for an elegant experiment that verified a structural 
model for DNA recognition by phage repressors. The 
repressors of the A-like phages 434 and P22 contain 
a helix- turn-helix structure (see Chapter 9) that rec- 
ognizes the operator DNA in the phage genome. It 
was hypothesized that amino acid side chains on one 
face of an a helix in the repressor protein make se- 
quence-specific contacts with operator DNA To test 
this hypothesis, a helix swap was performed (Figure 
11-12), Oligonucleotides were synthesized that en- 
coded the amino acids of the helix in the 4 34 represso r, 
with the five positions thought to contact DNA 
changed to those found in the P22 repressor. This 
synthetic fragment was swapped for the natural frag- 
ment in the 434 gene. The resulting hybrid protein 
gained the recognition specifi city of the P22 repressor, 
demonstrating that this helix indeed contacts the 
DNA 

Cassette mutagenesis with degenerate oligonucleo- 
tides can be used to create a large collection of random 
mutations in a single experiment. This method was 




Extend and 
Ugate 



Transform ung"*" £ co/j 
Select Amp R colonies 



Most colonies contain 
mutant plasm ids 



Mutant sfrand is 
replicated norm any 
in the cell 




Wild-type 
strand is 
degraded In 
ihe cell 



Desired mutation 



204 



CHAPTER 




Cleave with 
Hindltl and EcbRI 



Remove small 
fragment 



Wild -type 
sequence 



Two synthetic 
oligonucleotides 




Hinalll 



igase 



Wild-type 
DNA HmdU 




Mutant sequence 



Wild -type 
EcoR\ DNA 

Transform E. coli 




FIGURE 11-11 

Mutagenesis by casserre replacement. Plssmid DNA. is 
cleaved with resciicnon enzymes EcoRl and itfwdiri, which 
cur at sues chat flank the sequence to be mutated. The small 
cleaved DNA fragment contamina a portion of the wild- 
type sequence is removed, and a DNA fragment (cassette)' 
containing the desired mutation Is ligated into the plasmid. 
■Tins murant DNA fragment is composed of two comple- 
mentary "synthetic oligonucleotides chat have EcoRl and " " 
Htndlll sticky ends when annealed. Because there is no het- 

erodupiex intermediate- -the mutant cassette Is simply 

swapped, for the wild type fragment— the recombinant plas- 
mids are all mutants. A mutant cassette can be composed of 
degenerate oligonucleotides (see Chapter 7), resulting m a 
library of mutant plasmids containing different sequences. 



colonies contain mutant plasm id 



GRE function precisely,, single point mutations 
throughout the 30-bp region were generated and 
tested in cells for mducibiilty by glucocorticoid hor- 
mone. Two complementary oligonucleotides were 
synthesized that carried the 30-bp GRE, but synthesis 
was performed under conditions in which incorrect 
nucleotides were incorporated at a low frequency (Fig- 
ure 11-13). These "doped" oligonucleotides (that is, 
oligonucleotides produced by doping; see Figure 
11-13) were annealed and inserted as a cassette into 
■a promoter that lacked a GRE. Using this method, 
most single-nucleotide substitutions at the 30 positions 
were obtained. Such a collection of mutants would 
have been unthinkable before oligonucleotides revo- - 
Iutionized m vitro mutagenesis. 



Gene Synthesis Facilitates Production 
of Normal and Mutant Proteins 



used to study the structure of the glucocorticoid re- 
sponse element (GRE), an enhancer-, sequence that 
activates a family of genes in response to certain steroid 
hormones. The element had been mapped by deletion 
mutagenesis to a 30-bp region in a glucocorticoid-' 
regulated gene. To define the sequence required for 



The oligonucleotide-duected mutagenesis methods 
we have described use a single oligonucleotide or a 
parr of complementary oligonucleotides to Insert mu- 
tant sequences into an otherwise natural DNA frag- 
ment: With jhe^Jncreas ing availability .of longer 
oligonuc leotides, it is nj } w ifr^ 
tire gene I rcuni^symti 

JiliSfEHyL* typicalTy 40 to 80 

nucleoti des in length, that c an be annealed and ligated 
iiLIii£2.,!2.i^£'. m ble an entire donble-strandeu^DMA 



In Virro M 



uta genesis 



205 



434-R expression vector 



(a) 




Troiisi 



terrnirsa tor 



434 repressor 



P22 repressor 

2 y\ 



Ddel 



(b) 



Gly Thr 
Thr 




•Gin Ser Gjy Lgu Asn Ipjpjjf ^ 
434 a helix 3 




. -- '"Ser Thr 

Asn Ala Ser Trp Arglg^ 2^ - vGlu 

% . % 112^ y5 



Gin 



P22 a helix 3 




P22-speciftc hybrid a helix 3 



He 



Gkj jl:Gly Thr 

^r~i IPs*)-;. J . 



Remove 
wild type 
Fragment 



Gly- Ser 

V 

Thr 



Asn Spr Ser bfo Arg |||§§ ; f ^ 



Kpnl site 



Synthetic DNA fragment 



Dc/ei site 



Ligate 



C TCT AAT GTT TCG ATC TCG CAG CTC GAACGC GGG AAA AC 




ATGG AGA T ! A CAA AGC TAG A GC GT C GAG CTT GCG CCC T T_T„ TP^TT J 

Hybrid 



Transform £ coll 




Purify 



. -.■■t0^ /^/ repressor 
^zxL binds 
^#lfkP22 DNA 



Express hybrid 434/P22 protein 



FIGURE 11-12 

The helix swap experiment. Amino acids in the phage 434 repressor protein believed re - 
sponsible for recognition of the 434 operator were changed by cassette mutagenesis (Figure 
11 LI) of 434 DNA to the amino acids believed to perform the same function in an analo- 
gous region, of phage P22 repressor protein, (a) Expression in £. colt of the 434 repressor 
protein (left), with an enlargement of the site believed to bind the 434 operator; (tight) the 
corresponding section of the P22 repressor protein, (b) A cassette was synthesized resembling 
the 434 domain, but with P22-cype substitutions at positions thought to be essential for re- 
cognizing P22 operator DNA. This was Hgated into the digested 434 plasmid, and the re- 
combinant vector was introduced into £ coli to produce the hybrid protein, which then 
recognized P22 operator DNA but not the 434 operator . 




P22 operator DNA 



\06 



CHAPT E R 1 



DNA 



syn 



rhesszet 




Wild-type, sequence GGTTACAA. ACT 



GG|T A C A A A C T 
G M T T A C A A A C T 
GG|T AlA a A C T 
G G T I A C A A A C T, 
G G T T A C A 11 A C T 
GG T T AC A A A ® T 
G G T T A C A A ACT 



'Mutanf 

oilgonucleorides 




Wild-type 
oligonucleotides 



Oligonucleotide 
mixture 



Complementary 
"doped" oligomer 




| Anneal 

r 



Licjafe 

Transform colt 




Prepare pfasmid DNA 
from each colony 




Wild-type plasmlds 
and mutants with 4 
multiple substitutions 




Sequence each plasmtd 



Plasmids containing a 
single nucleotide change 



Test for funcf' 



FIGURE 11-13 

Cassette mutagenesis using doped' oligonucleotides r 0 gen 
are numerous mutants in a single experiment An oh^orra,- 
cieouae cassette eacoding the glucocorticoid response 
element (GRE) was synthesized by a DNA synods ma- 
chine. Synthesis was done under conditions m which each 
■ bottle containing a particular nucleotide precursor was 
"contaminated" (doped) with small amounts of the other 
three precursors. In the example above, the DNA synthes- 
izer was instructed xo make an oligonucleotide with the se 
quence GGTTACAAACT. Thus/when a nucleotide 

precursor is called for — a C for example -the machine 

adds an aliquot of the solution from the C bottle, and a C 
base is coupled to the end of most of the oligonucleotide 
chains. However, because die C bottle contains a small 
■ amount, of A, G and T E an incorrect base is sometimes 
added instead. Since the concentration of C Is roughly 30 
times chat of A, G and T, an Incorrect base will be added t 
about 1 out of 30 molecules. This results in a doped collec- 
tion of oligonucleotides, which actually consists of many 
different sequences, some wild- type and some with subsritu 
turns. -The level of contamination was adjusted to favor syn- 
thesis of oligonucleotides with only one substitution, but* 
because substitutions occur randomly, some molecules m 
the collection had none and others had two or more. Czs~ 
settes were formed by annealing complementary doped oli- 
gonucleotides and hgated into a vector. Plasmid DNA was 
isolated from 546 individual R coli transformers and ana- 
lyzed by sequencing. Of these, 224 were wild- type, 218 con- 
tained one substitution (for the 30 bases, of interest, 74 of 
the 90 possible single substitutions were recovered), and the 
rest contained two or more. 



ncnon 



synthesis, the ex- 
perimenter has total co ntrol oy erlthel^uence of the 
gene. It can be wild-tvg e_^ wav re „ 

qmred Because most amino acids are encoded by 
multiple triplet codons, genes encoding wild-type pro- 
teins can be constructed nsmg different. codons. Co 
dons can be chosen to place unique restriction sites 
throughout the sequence so that mutant cassettes can 
be czsily swapped m. This was done with the bacterial 
rhodopsin gene. Replacing a fragment of the synthetic 
gene with a new synthetic fragment identified the 
amino acid that is linked to the photon-absorbing 
chromophore that initiates photosynthesis. Other frag- 
ments can be exchanged as cassettes to study other 
important structural features of the proteim 

Codons can also be changed by gene synthesis to 
allow production of proteins at high levels in other 
organisms. Studies of the biochemistry of the Fos pro- . 
tein, encoded by a cellular protooncogene In animal 



In Vkro Mutagenesis 



207 



FIGURE 1 1-14 

Gene synthesis by ligation of complementary oligonucleo - 
rides.. To synthesize a gene char encodes a protein of inter- 
est; s set of overlapping complementary oligonucleotides are 
designed char can be combined to form a double-stranded 
DNA molecule that encodes the entire protein. The oligo- 
nucleotides are mixed together, heated at 90 U C for a few 
minutes to denature the strands, and then cooled slowly to 
room temperature. During this period the oligonucleotides 
anneal through complementary base pairs. The oligonucleo- 
tides are designed so that each one anneals ro two adjacent 
oligonucleotides from, the opposite strand, bridging them. 
Generally, oligonucleotides ranging in. length from 40 ro 80 
nucleotides are used in gene synthesis. The annealed oli- 
gonucleotides are covalently linked by DNA ligase, produc- 
ing two contiguous DNA strands. This synthetic gene is 
usually purified from a gel before ligation into a vector. The 
resultant recombinant plasmid is obtained following trans- 
formation into E. coli and is sequenced to check that the 
correct sequence was synthesized. The sequence of the syn- 
thetic gene can be designed to place restriction sites at con- 
venient locations for cassette mutagenesis. 



cells (Chapter 18), have been severely hampered by 
the inability to produce the protein in E. colt. This 
problem was finally solved by synthesizing a portion 
of the fos gene entirely from oligonucleotides, chang- 
ing natural fos codons to the codons used most effi- 
ciently in E. coll Insertion of this synthetic gene into 
'an E. coli expression vector allowed for the first time 
the production of large quantities of active Fos protein. 
The gene was also designed with several unique re- 
striction sites so that efficient cassette mutagenesis can 
now be coupled to the biochemical assays for Fos 
function. 



The PGR Caa Be Used to Construct 
Genes Encoding Chimeric Proteins 

The ease with which mutations can be made in a 
protein coding sequence has revolutionized the study 
of protein function. A functional domain can be iden- 
tified by making a series of mutant pro te his, then 
testing which substitutions cause a change in function. 
However, it is not often easy to decide where to make 
a mutation. In the example of the helix swap exper- 
iment (Figure 11-12), the domain that bound DNA 
had been, previously identified And the design of the 
experiment was guided by having a model for the 
three-dimensional structure of the repressor protein. 



DNA sequence or 
ammo acid sequence 



c 



S3 



Complementary 

synthetic 

oligonucleotides 



Heat at 90° C 




Slowly cool to room temperature 




"- / A- 
Intermediates 



DNA iigase 



ZJ Isolate double-stranded 
iiiCi '1 DNA m ol ec u I e 




DNA ligase 



Expression vector 



Transcriptional 
terminator 




Unique restriction 
sites for cassette 
mutagenesis 



Express recombinant 
protein in E. coli 



However, for most proteins, little structural infor- 
mation is available. Identifying a functional domain — 
for example,, a region of the protein that may interact 
with another protein — is difficult to do by inspecting 
the primary amino acid sequence. A simple strategy 



208 



C U AFTER 1 i 



ugnf 
chain 



taG? 
■y -^-{a-, 



r~ , - | r < "S 

iricKy foot 



-H-: /-■ 



I il Q-|2 domain 
72b heavy chain I :•: 



Polymerase 
chain reaction 



Oene lor 
heavy chain 
["(2d ^otype] 



"PLR primer s 



400 bp 




Yi Qj2 domain 



Expression vector 
for y 1 heavy chain 



Make uracil- 
containing single- 
stranded DNA 




Heat to denature sfrand; 



Sticky foot 



IgG. 



Extend and ligate 



Does not 

activate 

complement 



Activates 
complement 



§ i 
p 1| 



Antibody with 
chimeric heavy chain 




Transform ung + £_ co// 
isolate mutant DNA 



Coexpress in 
mammalian cells 
with light chain 




FIGURE 11-15 

. Construction of a chimeric antibody heavy-chain -encoding gene by "sticky feet-directed" 
mutagenesis Antibodies containing a yZb heavy chain are known to participate m comple- 

Senn Jrht co ™™»S V 1 chains do not In order to 

identify which domain of the y2b heavy chain is responsible for this property, an antibody 

heavv n cha g in f^T T * ^ T ° COnSttUtt 3 S Cne en «*W ^ chimeric 

obr/d T' k T P , ^ agment C "'-" (i '". t ' che C « 2 domain from a yl heavy chain was "re- 
placed with the homologous segment from a ylb gene. Since there were no convenient re- 
daction sues at the ends of the C H ? segments, the 400-nucleotide-long y2b- DNA was 
prepared . by PCR. The PGR primers were complementary to the ends of the yZb DNA but 
■contained -additional nucleotides (the sticky feet) that were complementary to yl DNA at the 
boundaries of the y l C H 2 domain The strands of the PCR-generated fragment were sept 

T^Ll r mg ' StTar fy 3S used 35 ^ Punier in a mutagenesis experiment usin. 

a uiaal-contammg single-stranded yl DNA template by the method shown in Ficnire 11-10 

n ^r?, l| ng C f menC hea 7'f h T gene w «-Coexpressed with a light chain gene m mam-' 
m.han cells to form an annboay that now activated complement. Since only the C H 2 domain 

are j roni the y2b heavy-cham, this result demonstrated that the ,2b C„2 domain contains 
cfu information necessary to activate cornplement-dependenr cell lysis. Sticky feet-directed 
mutagenesis provided a simple means for constructing this complicated gene! 



In. 



v ir 



"O Muragenesi: 



209 



that helps to narrow down important ammo acids in 
a. protein is the analysis of chimeras between, related 
proteins. We have previously discussed the use of 
computet programs to identify related proteins by 
comparison of their ammo acid sequences (Chapter 
8). Chimeric proteins are constructed, by replacing a 
segment of one protein with the homologous segment 
from another protein. Although the two proteins have 
functional differences, their sequence similarity often 
indicates that they share a common overall structure. 
A striking example of this was in the anal ysis of human 
growth hormone (hGH).. A series of chimeric proteins 
were made in which most of the amino acids were 
derived from hGH but which contained segments from 
related hormones, such as human prolactin. Using this 
strategy, regions of hGH that interact with the hGH 
receptor were identified. In Chapter 17, we will see 
how functional regions of a receptor which spans the 
membrane seven times were identified by the study 
of chimeras- 

The 434/P22 repressor (Figure 1112) and hGH 
chimeras were constructed by ligation of short oli- 
gonucleotide cassettes into the coding sequence. A 
different strategy (Figure IF 15) was used to prepare 
a chimeric antibody in which a 4G0-bp segment from 
a yl heavy-chain gene was replaced by the homolo- 
gous segment from a ylh gene, A 400-bp DNA frag- 
ment was generated by PGR that encoded the new 
sequence to be inserted and two 30-base "sticky feet" 
on each end. The double -stranded PGR fragment was 



heated to denature the two strands, and then, one of 
the single -stranded molecules was utilized in a unmet ■ 
extension experiment (as in Figure 11-8). Had the 
gene synthesis method, been employed, construction 
of the chimeric gene would have required twenty 40- 
nudeotidedong oligomers- Instead, the sticky feet, 
method used only two oligonucleotide primers for 
PGR. 

Mutagenesis Is the Gateway to Gene 
Function and Protein Engineering 

It would be difficult to overestimate the importance 
of in vitro mutagenesis techniques to biology and bio- 
technology. The harnessing of enzymes that operate 
on DNA and the refinement of oligonucleotide syn- 
thesis have made changing gene sequences an almost 
trivial task. And the ability to operate on DNA lets, 
us also change the structure of the products of genes — 
RNA and, most importantly, proteins. Thus, the im- 
pact of this technology is twofold. It has revolutionized 
how research is done in molecular biology by creating 
the entirely new concept of £t reverse, genetics" — ■ 
changing gene sequence first, then examining gene 
function. And it opens the door to sophisticated protein 
engineering (see Chapter 23) 7 the ability to make 
changes in natural gene products that make them do 
their jobs better. The impact of protein engineering 
on medicine and industry will be substantial 



Reading List 

General 

Wu, R., L. Grossman, and KL. Moldave, eds. Methods in En- 

'zyrnofagy, vol 100: Recombinant DNA, Part B. Academic, 

New York, 1983. 
Gait, M. Oligonucleotide Synthesis: A Practical Approach. IRL, 

Oxford, Eng n 1984. 
Wu, R., and L. Grossman, eds. Methods in Enzynwlogy, vol 

154: Recombinant DNA, Part E. Academic, New York, 1987. 



Original Research Papers 

STRATEGIES FOR IN VITRO MUTAGENESIS 

Botstein, D. 7 and D.-S horde. "Strategies and applications of 
in vitro mutagenesis." Sciertce, 229: 1193 — 1201 (1985). 
[Review] 

Smith, M. "Ia vitro mutagenesis." Ann. Rev. Gen,, 19: 423 — 
462 (1985). [Review] 




C I • AFTER 11 



Zoiier, M.J. "New molecular biology methods for protein 
engineering." Cure. 0pm. Struct Biol., 1: 605-610 (1991), 
■ i [Review] 

INSERTION MUTAGENESIS 

Heffron, F., M_ So, and B J. McCarthy. "In vitro mutagenesis 
of a circular DNA molecule using synthetic restriction 
sires." Proc Nad Acad. Sa. USA, 75: 6012-601.6 (1978). 

Abraham, J.J". Feldman, K. A. NasmythJ. N. Strathern, A, 
J. S. KlarJ. R. Broach, and J. B. Hicks. "Sites required 
for position-effect regulation of mating type informa- 
tion in- yeast." Cold Spring Harbor Symp. Quant Biol, 47: 
989-998 (1983). 
Rees Jones, R. W, and S. P. GofE "Insertional mutagenesis 
of the Abelson murine leukemia vims genome: iden- 
tification of mutants with altered kinase activity and 
defective transformation ability." J. Virol, 62: 978- QS6 
(1988). 



DELETION MUTAGENESIS 

Lai, C. J., and D. Nathans. "Deletion mutants of SV40 
generated by enzymatic excision of DNA segments 
from the viral genome;'/. Mol Biol, 89: 1 79-193 (1974). 
Mertz, J, E, J. Carbon, M. Herzberg, R. W. Davis, and P. 
Berg. "Isolation and characterization of individual 
clones of simian virus 40 mutants containing deletions, 
duplications and insertions in their DNA." Cold Spring 
Harbor Symp, Quant. Biol, 39: 69-84 (1974). 
Sakonju, S., D. Bogenhagen, and D. D. Brown. "A control 
region in the center of the 5S gene directs specific 
initiation of transcription, I: the 5' border of die region." 
Cell, 19: 13-26 (1980). 
McKnight, S. L., and R. Kingsbury. "Transcriptional control 
signals of a eukaryotic protein -coding gene." Science 
217:316-324 (1982). 
Struhl, K. "The yeast his3 promoter contains at least two 
distinct elements." Proc, Natl Acad. Set. USA, 79: 7385- 
7389 (1982). 



CHEMICAL MUTAGENESIS 

Chu, C. T, D. S. Parris, R. A. F. Dixon, F. E. Farber, and 
P. A. SchafFer.-"Hydroxylamine mutagenesis of HSV 
DNA and DNA fragments: introduction of mutations 
into selected regions of the viral genome." Virol, 98: 
168-181 (1979). 

Shortle, D., and D. Nathans. "Regulatory mutants of simian 
virus 40: constructed mutants with, base substitutions 
at the origin of viral DNA replication." J. Mol Biol 
131: 801-817 (1979). 



Werner, H. ; and H. Schaller. ^Segment-specific mutagenesis: 
extensive mutagenesis of a lac promoter/operator ele- 
ment." Proc. Natl Acad Set. USA. 79: 1408—1412 (198?) 

Myers, R... M, L S. Lerman, and T. Maniatis. "A general 
method for saturation mutagenesis of cloned DNA frag- 
ments." Science, 229: 242-247 (198 5) 

ENZYMATIC MISINCORPORATTON MUTAGENESIS 

Shortle, D. P. Grisafi, SJ. Renkovic, and D. Botsteim "Gap" 
misrepair mutagenesis: efficient site-directed introduc- 
tion of transition, transversion, and frarneshift muta- 
tions in vitro." Proc. Nad Acad Set USA 79- 1588-1592 
(1982). 

Zakour, R. A., and L. A. Loeb. "Site-specific mutagenesis 
by error-directed DNA synthesis/ 5 Nature 295" 708- 
710 (1982). 

Abarzua, P. ; and K.J xVIarians. "Enzymatic techniques for 
the isolation of random single -base substitutions in vi- 
tro at high frequency." Proc. Natl Acad. Set. USA 81 - 
2030-2034 (1984). 

Leung, D, W., E. Chen, and D. V. Goeddel. "A method for 
random mutagenesis of a defined DNA segment using 
a modified polymerase chain reaction." Technique t 
11-15 (1989) 

OLIGONUCLEOTIDE DIRECTED MUTAGENESIS 

Hutchinson, C. A., Phillips, M. H. Edgell, S. Giliam, P. 
Jahnke, and M. Smith. "Mutagenesis at a specific po- 
sition in a DNA sequence;' J. Biol Chew., 253: 6551- 
6560 (1978). 

Wallace, R. B., M. Schold, M. J. Johnson, P. Dembek, and 
K. Itakura. "Oligonucleotide-directed mutagenesis of 
the human /?-giobin gene; a general method for pro- 
ducing specific point mutations in cloned DNA " Nuc. 
Acids Res., 9: 3642-3656 (1981). 

Zoller, M. J., and M, Smith. "Oligonucleotide-directed mu- 
tagenesis using Ml3-derived vectors: an efficient and 
general procedure for production of point mutations 
in any fragment of DNA." Nuc. Acids Res., 10: 6487- 
6500 (1982). 

Kramer, W, V. Drutsa, H. WJansen, M. Pflugfelder, and 
H.-J. Fritz. "The gapped duplex DNA approach to 
oligonucleotide-directed mutation construction. 1 ' Nuc. 
Acids Res., 12: 9441-9456 (1984J 

Kunkel, T. A. "Rapid and efficient site-specific mutagenesis 
without phenotypic selection" Proc. Natl Acad Sci USA, 
82: 477-492 (1985). 

Taylor J. W.J. Ott, and F. Eckstein. "The rapid generation 
of oligonucleotide-directed mutations at high fre- 
quency using phosphorothioate-modified DNA." Nuc 
Acids Res, 13: 8765-8785 (1985). 



la 



tro 



Mutagenesis 



211 



CASS r - if E M U T AGENESIS 

Lo, K.-M., S. S. Jones, N. R- Hackett, and H. CI Khorana.. 

"Specific amino acid substitutions in bacterioopsin: re- 
placement of a restriction fragment, in the structural 
eene bv synthetic DNA fragments containing altered 
codons" Proc. Nad Acad. Set.. USA, 8 1 : 2 23 5 - 7289 (1984). 

Wells, J. A., M. Vasser, and D. B. Powers, "Cassette mu- 
tagenesis: an efficient method for ^feneration of multiple 
mutations at defined sites." Gene, 34: 315 -32 3 { 1 98^1 

Wharton, R-, and M Ptashne. "Changing the. binding spec- 
ificity of a repressor by redesigning an alpha-helix." 
Nature, 316:601-605 (1 98 5 ). 

Reidhaar-Olson J. F , and R. T. Sauer. "Combinatorial cas- 
sette mutagenesis as a probe of the informational con™ 
tent of protein sequences/ 1 Science, 241: 53 —57 (1988)- 

DOPED OLIGONUCLEOTIDE MUTAGENESIS 

McNeil J. and M. Smith. " Saccharomyces cerevisiae CYCl 
mRNA 5'-end positioning: analysis by in vitro muta- 
genesis, using synthetic duplexes with random mis- 
match base pair." MoL Cell. Biol, 5: 3545-3551 (1985). 

Hutchison, C. A., S. K. Nordeem K. Vogt ? and M H. Edgell. 
"A complete library of point substitution mutations in 
the glucocorticoid response element of mouse mam- 
mary tumor virus." Proc. NatL Acad. Set. USA, 83: 710- 
714 (1986). 

GENE SYNTHESIS . 

Ferreti, L., S. S. Kamik, H. G. Khorana, N. Nassal, and D 
D. Oprian. "Total synthesis of a gene for bovine rho- 
dopsim" Proc. Nad Acad. ScL USA, 83: 599-603 (1.986). 



Carurhers, M. H., A. D. Barone, S. L. Beaucage, D. R. Dodds, 
£. F. Fisher, L. J. McBride, M.. Matceucci, Z. Stabinsky, 
andJ.-Y. Tang. "Chemical synthesis of deoxyoligon- 
ucleotides by the ' phosphoramidite method:' Meth. Rn- 
■zymoL 154: 287-313 (1987). 

Abate, C. T D. Luk, R. Getz, FJ. Rauscher, III, and T. Outran. 
"Expression and purification of the leucine zipper and 
DNA -binding domains of Fos and Tun: both Fos and 
Jun contact DNA directly." Proc. NatL Acad. Set. USA, 
87: 1.032, 1036 (1990).. 

CONSTRUCTING CHIMERIC GENES BY PGR 

Higuchi, R., B. Knimmel, and R. K. Saiki. "A general method 
of in vitro preparation and specific mutagenesis of 
DNA fragments: study of protein and DNA interac- 
tions/ 5 Nuc. Acids Res, 16: 7351-7367 (1988). 

Ciacksom T, and G. Winter. "'Sticky feet'-directed mu- 
tagenesis and its application to swapping antibody do- 
mains; 7 Nuc. Adds Res., 17: 10163-10170 (1989). 

PROTEIN STRUCTURE AND FUNCTION 

Winter, G, A. R. Fersht, A. J. Wilkinson, M. Zolier, and 
M Smith. "Redesigning enzyme structure by site-di- 
rected mutagenesis;' Nature, 299: 756-758 (1982). 

Cunningham, B. C, P. Jtmrani, P. Ng, and J. A. Wells. 
"Receptor and antibody epitopes in human growth hor- 
mone identified by hornolog-scanning mutagenesis." 
Science, 243: 1330-1336 (1989). 

Gibhs, C. S., and M. J. Zoller. "Rational scanning mutagene- 
sis of a protein kinase identifies functional regions in- 
volved in catalytic and substrate interactions." J- BtoL 
Cbem; t 266: 8923 -8931 (1991)- 




3in.ciijtt 





stry 



soon as the first successful cloning experiments were reported in 1973, 
applications for this powerful technology quickly followed. The signifi- 
cance of being able to produce large quantities of human proteins that were 
normally available in exceedingly small amounts, if at all, was not lost on 
scientists, physicians, and businessmen alike. In 1976 biotechnology became a 
reality as the methodologies for DNA cloning, oligonucleotide synthesis, and 
gene expression converged in a single experiment, in which a human protein 
was expressed from recombinant DNA for the first time. The protein was 
somatostatin, a 14 amino acid peptide neurotransmitter. The gene encoding 
somatostatin was not the natural gene but was synthesized. chemically and cloned 
into a plasmid vector for expression in E. coll Soon after followed the successful 
expression of human insulin for the treatment of diabetes, the first commercial 
product of the biotechnology industry. Instead of insulin extracted from the 
pancreases of pigs and cows, diabetics could now receive insulin identical to 
that normally produced by humans. 

The ability to achieve such feats relied on the successes in all areas of molecular 
biology, including oligonucleotide synthesis,' isolation of enzymes that cleave 
and join DNA, characterization of bacterial plasmids, and an understanding of 
gene expression. These methods have, of course, revolutionized research m 



453 



C H A PIER 



biology and medicine, but what is equally irruxwmif 
cney .nave spawned an entirely new industry, one de- 
vote 4 t0 the cloning and production of proteins of 
importance to both medicine and industry. Today, 
proteins are produced through recombinant DNA 
technology for treatment of numerous diseases--- can- 
cer, allergies, autoimmune disease, neurological dis- 
orders, heart attacks, blood disorders, infections, 
wounds, and genetic diseases— as well as for more 
prosaic tasks, such as use m laundry detergents and 
food production. In addition, entirely new approaches 
to drug design have emerged from recombinant DNA 
technology; as scientists have gained the ability to 
tinker with natural proteins to improve their function 
and to change them m subtle and useful ways. 



Expression Systems Are Developed to 
Produce Recombinant Proteins 

Cloning the gene or c DNA encoding a particular pro- 
tein is only the first of many steps needed to produce 
a recombinant protein for medical or industrial use. 
The next step is to put the gene into a host cell for 
production. The development of expression systems 
has been an important research area in both industrial" 
and academic laboratories. The most popular expres- 
sion systems are the bacteria £ colt and Bacillus sukilis, 
yeast, and cultured insect and mammalian ceils. We 
have learned in earlier chapters about the development 
of vectors and DNA transformation methods for these 
organisms. Here we will discuss the issues that are 
important for protein production. The choice of which 
cell is used depends on the project goals and on the 
properties of the protein to be produced. 

Bacterial cells offer simplicity, short generation 
times, and large yields of product with low costs. And, 
especially with B subtilis, the cells can be induced to 
secrete the product into the culture medium, thus 
greatly simplifying the task of purification. But expres- 
sion in prokaryotic cells has several drawbacks. Al- 
though some proteins are expressed to"" high levels 
(greater than .10 percent of the mass of all "bacterial 
proteins), they often fail to fold properly and hence 
form insoluble inclusion bodies. Protein extracted from 
these inclusion bodies is often biologically inactive. 



Small proteins can sometimes be refolded into their 
active forms, but larger protcm.s usually cannot. A 
second problem is that foreign proteins are sometimes 
toxic co bacteria, so cell cultures producing the protein 
cannot be grown to high densities. This problem can 
often be circumvented by using an inducible promoter 
that is aimed on to begin transcription of the gene 
for the foreign protein only after the culture has been 
grown. Third, bacterial cells lack. - enzymes that are 
present m eukaryotic cells and add posttranslational 
modifications, such as phosphates and sugars, to pro- 
teins. These modifications are often required for 
proper functioning of proteins. Researchers are ad- 
dressing this problem by purifying the eukaryotic en- 
zymes that carry out these modifications and using 
these enzymes _to add the needed modifications to 
bacterially exptessed proteins. 

Yeast has been used for centuries by brewers and 
bakers, and now it toils for biotechnologists as well. 
As discussed in Chapter 13, yeast is a simple eukaryote 
that resembles mammalian cells in many ways but can 
be grown as quickly and cheaply as bacteria can/ Yeast 
perform many of the posttranslational modifications 
found on human proteins and can be induced to secrete 
certain proteins into the growth medium for harvest- 



ing. 



/Y /~f \ C- -r\ ■■ r I r r 



isaavantage oi yeast is the presence of active 
proteases that degrade foreign proteins, thereby re- 
ducing the yield of product. Researchers are dealing 
with rhis problem, however, by constructing yeast 
strains in which the protease genes have been deleted 
Expression of heterologous proteins in insect cells 
by baculovirus vectors (as previously described in Fig- 
ure 12-12) is a relatively new approach. The main 
advantages are high-level expression, correct folding, 
and posttranslational modifications similar to those in 
mammalian cells. A vaccine for the AIDS virus has 
been prepared by producing one of the HIV glyco- 
proteins with this system. Although the cost of cuD 
turmg insect cells is currently more than that for 
culturmg bacteria and yeast, it is less than that for 
culturing mammalian cells. 

Despite the significant advantages .of producing hu- 
man proteins in heterologous host cells, in some cases 
the best place to produce a mammalian protein is in' 
mammalian cells. Great improvements have been 
made to promoters, vectors, transformation protocols, 
and host cell systems. Transient expression in mam- 



jmalian cells (described in Figure 12-4) is often used 
for checking; the function of a newlv cloned, eerie and 
as a quick method for assessing the function of en- 
gineered proteins. The extracellular domains oi cell- 
surface receptors (Chapter 17) have been, engineered, 
for secretion from, cells by introducing a stop codon 
into the gene before the transmembrane domain se- 
quence. These soluble receptors are valuable reagents 
for studying li^and binding in vitro and for screening 
for receptor agonists or antagonists, and they may 
eventually be used as therapeutics themselves. Al- 
though transient systems yield enough protein, for lab- 
oratory experiments, stably integrated amplified genes 
in mammalian ceils are used for the large-scale pro- 
duction of proteins such as tissue plasminogen acti- 
vator, which we describe later. 



Insulin Is the First Recombinant Drug 
Licensed for Human Use 

The first licensed drug produced through genetic en- 
gineering was human insulin. An important hormone 
that regulates sugar metabolism, insulin is produced 
by a sm all number of ceils in the pancreas and secre ted 
into the bloodstream. An inability to produce insulin 
results in diabetes, but daiiv injections of insulin are 
sufficient to reverse or at least allay the debilitating 
effects of the disease. Prior to production of the re- 
combinant molecule, insulin for treatment of diabetes 
was obtained from the pancreases of pigs and cows. 
Although this insulin is biologically active in humans, 
the amino acid sequences are not identical to that of 
the human molecule. Thus, some patients produced 
antibodies against injected insulin, occasionally re- 
sulting in serious immune reactions. Because recom- 
binant human insulin is identical ■ to the natural 
product, immunogenicity should not be. a problem. 

In mammals, insulin is expressed as a single-chain 
prepro-hormone, which is secreted through the plasma 
membrane. A prepro-hormone contains extra ammo 
acids not present in the mature hormone. Amino- 
termin al amino acids form the pre sequence and target 
the expressed protein for secretion. The pro sequence 
is a stretch of amino acids in the middle of the hormone 
sequence that is important for folding the polypeptide 



chain into the correct, structure. During secretion, 
these extra amino acids are cleaved from the prepro- 
hormone by cellular proteases to release the mature 
insulin molecule, consisting of two short polypeptide 
chains, A and B 5 linked by two disulfide bonds. The 
principal challenge in the production ot recombinant 
insulin was getting insulin assembled into this mature 
form. The initial approach was to construct synthetic 
genes from oligonucleotides that separately encoded 
the A and B chains. These were individually inserted 
into the E. coli gene encoding /J-galactosidase, so the 
bacteria produced large fusion proteins that had the 
insulin sequences tacked onto the end of the /?- 
galactosidase enzyme (Figure 23-1). These large pro- 
teins were purified from bacterial extracts, and the 
insulin chains were released by treatment with cyano - 
gen bromide, a chemical that cleaves peptide bonds 
following methionine residues. Because a methionine 
codon had been inserted at the boundaries between 
jS-calacrosidase and the insulin chains in the fusion 
proteins, cyanogen bromide treatment clipped intact 
insulin chains off the fusion proteins. These were pu- 
rified, mixed, and reconstituted into an active insulin 
molecule. This approach was refined by producing a 
single /i-galactosidase -insulin fusion protein, which 
could be cleaved in. a single step to release mature 
insulin. A similar method is now in use for the com- 
mercial production of recombinant insulin. 



Recombinant Human Growth 
Hormone is Produced in 
Bacteria by Two Methods 

Growth hormone is a 191 amino acid protein that is 
produced in the pituitary gland and regulates growth 
and development. Children born with growth hor- 
mone deficiency — hypopituitary dwarfs- -never 
achieve normal stature. Regular injections of growth 
hormone stimulate the growth of these children so 
that they reach near-normal heights Unlike the sit- 
uation with insulin, animal-derived growth hormones 
are ineffective. Only the -human protein works, and 
for many years it was painstakingly extracted from the 
pituitaries of human cadavers. One unforeseen and 



C H 



P j E R 



Bacferiol promofer „ f 
\ - - P~9 a! 



P • i- 

£j insulin 

-A 7 chain 



Amp R 




P-gal 



(( 



Insulin 



■B ch 



am 



Trans form into E. coii 





Cuk 



ure cells 




Purify p~gal— insulin 
fusion proteins 



Amino- 
termtnus 
of mature 
B chain 



——A chain 



8 chain -^l*!. i 



V 



■p-gal 

peptides 



Treat with CNBr 



A ch 



CNBr cleaves 
the peptide 
bond after 
methionine 



am 



Purify A and B chains 



B cf 



lain 



Refold and oxidize cysteines 



NH 2 -L r L CGOi i * f - |- 

NH 2 U*Jw.,., COOH Achve '- nsulln 



Disulfide bond 



unfortunate consequence of growth hormone treat- 
ment, however, was the infection of a number of chil- 
dren with a fatal vims from one of the cadavers. 
Production of recombinant human growth hormone 
(hGH) would clearly provide a safe, reliable, and plen- 
tiful source of this druQ" 

I ne initial production of hGH was achieved by 
constructing a hybrid gene from the natural hGH 
cDNA and synthetic oligonucleotides that encoded 
the amino terminus of the mature form of the protein 
(Figure 23-2a). This coding sequence was ligated into 



FIG If RE 23-1 

Expression of human insulin la £ coll. Recombinant insulin 
was first made by expressing the A and B chains separately 
then refolding them into a mature! insulin molecule A DNA. 
fragment encoding each insulin chain was made by anneal- 
ing two complementary oligonucleotides that had been, 
chemically synthesized. Each fragment was ligated into a 
bacterial expression vector so that, when translated, the. in- 
sulin chain would be fused to the carboxy -terminus of rhe 
enzyme /?~galactosidase (/?-gal). The expression vectors 

were transformed into £ coli, and the 8 gal insulin fusion 

proteins accumulated inside the bacterial cells. The cells 
were harvested, and each /?-gal-insulin fusion protein was 
purified- The insulin -coding DNA was synthesized so that it 
started with a methionine codon. This setup provided a way 
to cleave off the /f gal part from the insulin polypeptide. 
Treatment of the fusion protein with the chemical cyanogen 
bromide (CNBr) results in cleavage of peptide bonds after 
all methionines. In this way, the natural insulin peptides 
were obtained. Because /?-gal contains other methionine res- 
idues, CNBr treatment cleaved it into many small peptides. 
The insulin chains were not cleaved further because they 
did not contain internal methionines. The A and B chains 
were purified and then mixed together to form active re- 
combinant insulin. 



a plasmjd adjacent to a bacterial promoter. Like insulin, 
hGH is normally produced as a larger precursor pro- 
tein containing an ammo-term mal signal sequence. 
Because the human signal sequence would not be rec - 
ognized by the bacterial secretion machinery, the 5' 
end of the cDNA was reengineered with a synthetic 
DNA sequence enabling the bacteria to produce a 
nearly normal version of the mature human protein. 

The first hGH expression vectors directed the pro- 
duction of the protein inside the cell Purification re- 
quired many steps to separate hGH from the thousands 
of intracellular bacterial proteins. Another way to pro- 
duce the protein in bacteria is to engineer the protein 
so it is secreted. This can be done by linking the coding 
sequence for the desired protein to a signal sequence 
from a secreted bacterial protein, thus forming a pre- 
hormone (Figure 2 3-2b). Human growth hormone is 
produced by the bacteria and then secreted with the 
concomitant removal of the signal peptide by a bac- 
terial protease. Secretion into the periplasm, where 
there are fewer proteins than inside the cell, makes 
purification simpler. The only difference between the 
secreted hGH and rhat produced mtracelluiariy is the 
presence of an ammo-terminal methionine on the in- 
tracellularly expressed molecule. Because the secreted 



Recombinant DNA in Medicine and I ados try 



4.) 



{a] Signal sequence (mammalian} 



tcoR} 



C 



First amino acsd 
of mahjre hGH 



Remove codons for 
amino acsds 1 -24 



HiVrdlll 

X 



T 



j Cloned hGH cDNA 



"~ Slop cod on 
Cleave wi m EcoR! 



HinSU EcoRl 
/ 



Synthetic oligonucleotide 
for amino acids 1—24 



Hfndlil f C oR| 

V i 



Isolate DNA fragment 
encoding amino acids 24-1 91 



Li gate 
H/ndill 



initiator 
methionine 



Cleave 
with 
Hin6\\i 



V" 




■Ik' Bacterial 



signal sequence 



Ugate 



Expression vector 



initiator 
methionine 



Initiator 

methionine 
Promoter 



Amp 




Growth hormone 
coding sequence 




Growth hormone 
coding sequence 



Growth hormone 
secreted into 
periplasmic space 



Transform 
E. colt 



Transform E coli 



Growth hormone 
accumulates inside 
bacterial cell 





initiator methionine 



NH- 



hGH isolated from 
bacterial extract 



"Met less" hGH 
NH," 



hGH released by 
periplasmic "shock* 



Peripl asmtc 
protease cleaves 
off signal peptide 



COOH 



COOH 



Sequence of protein identical 
to natural growth [tor man e 



FIGURE 23-2 

Bacterial production of human growth hormone (hGH). (a) An expression vector was con- 
structed for intracellular production of hGH. The coding sequence was constructed by iso- 
lating from die cDNA a DNA fragment that encoded amino acids 24-191 and ligating this 
to a synthetic oligonucleotide fragment that encoded amino acids L— 24. Following introduc- 
tion of the expression vector into bacterial cells, recombinant hGH was produced inside the 
cells- The expressed protein behaved just like natural human growth hormone but coin slued 
the initiator methionine at the amino terminus, (b) A protein can he produced in bacteria 
without this extra methionine by targeting it for secretion.. To do this, a DNA fragment 
encoding a. bacterial signal sequence, which specifies secretion of a bacterial protein, was 
placed in front of the hGH coding sequence. Upon introduction of this vector Into bacteria, 
hGH is produced, and the signal sequence targets the protein for secretion. The protein 
accumulates in the periplasmic space 'between the inner and. outer bacterial membranes and 
can be released by hypotonic disruption of the outer membrane. In contrast to the intracellu- 
lar form of hGH, the protein produced by this procedure does not contain an initiator me- 
thionine, since a periplasmic protease cleaved off the signal sequence. 



C H A P T £ R 2 3 



-form lacks this methionine, it: is called met- less hGH. 
Bacteria Hy expressed hGH has been administered to 
thousands of growth hormone-deficient children, who 
have benefited greatly from this recombinant drug. 



A Hepatitis B Virus Vaccine Is - 
Produced, m Yeast by Expression 
of a Viral Surface Antigen 

One of the successes of modern medicine is the de- 
velopment and implementation of vaccines apamstim 
fectious diseases. Prior to the advent of recombinant 
DNA technology; two types of vaccines were used. 
Inactivated vaccines are chem.xca.Uy killed derivatives 
of the actual infectious agent. Attenuated vaccines are 
live viruses or bacteria altered so that they no longer 
multiply in the maculated " organism. Both types of 
vaccines work by presenting surface proteins (anti- 
gens) to B and T lymphocytes, which become primed 
to respond rapidly should the organism actually be- 
come infected, usually destroying the infectious agent 
before any damage is done (Chapter 16). However, 
these types of vaccines are potentially dangerous be- 
cause they can be contaminated with infectious or- 
ganisms. For example, a small number of children each 
year contract polio from their polio vaccinations. Thus, 
one of the most promising applications of recombinant 
DNA technology is the production of sub unit vaccines, 
consisting solely of the surface protein to which the 
immune system responds. With a subunit vaccine, 
there is no risk of infection. 

The first successful subunit vaccine was produced 
for hepatitis B virus (HBV), which infects the liver 
and causes liver damage and, in some cases, cancer. 
The virus particle is coated with a surface anticen 
HBsAg, and infected patients carry large aggregates 
of this protein in their blood. Early experiments sug- 
gested that these aggregates' would make a potent 
vaccine, but how could they be produced in quantities 
sufficient to vaccinate large populations against HBV? " 
With the cloning of the HBV genome, the possibility 
of a subunit vaccine could be explored. Initial attempts 
to produce the HBsAg protein in £ colt failed, so 
researchers turned to yeast. The HBsAg gene .was 
inserted into a high-copy yeast expression vector (Fig- 



ure 15-3) and engineered, in this case, so that it would 
not be secreted (Figure 23-3). Yeast transformed with 
this plasmid produced, large quantities of die viral 

protein (about [ 2 percent of the total yeast protein). 

By growing the yeast in large fer mentors, it was pos- 
sible to produce 50 100 mg of the protein per liter 

of culture. This recombinant protein closely resembled 
the natural viral protein; it even formed aggregates 
with properties similar to those of the immunogenic 
aggregates found in infected' patients. The yeast pro- 
tein is now used commercially to vaccinate people 
against HBV infection. 

Vaccines agamst many human and animal patho- 
gens are currently in various stages of development. 
Recombinant DNA technology has provided a safe 
means to work with and to inoculate children and 
adults with only noninfectious parts of infectious 
agents. In Chapter 25, we will discuss various strategies 
for the development of a vaccine against the AIDS 
vims. 



Complex Human Proteins Are 
Produced by Large-Scale 
Mammalian Cell Culture 

Most of the recombinant proteins we have discussed 
thus far in this chapter are relatively small and simple 
in both structure and function. Other proteins of med- 
ical interest are considerably more complicated in 
structure and function, and biologically active proteins 
have proved difficult to produce in bacteria and yeast 
In these cases, biotechnology companies have resorted 
to using mammalian cells for protein production. 
Mammalian cells are finicky and expensive to grow, 
but they can be counted on to produce correctly mod-, 
ifietl fully active proteins. Thus, much effort in the 
biotechnology industry has been devoted to setting up 
fermentor systems for large-scale culture of mam- 
malian cells. 

The first drug to be produced commercially by 
mammalian cell culture was tissue plasminogen activator 
or tPA, which is administered to heart attack victims. 
Tissue plasminogen activator, is a protease, an enzyme 
that cleaves other proteins. It works by clipping plas- 
minogen, an inactive precursor protein, to form plasmin, 
itself a potent protease that degrades fibrin, the protein 



Recombinant DNA in Medicine and Industry 



459 



donee 



d HBV Dl 



••J A 



S protein. 

in ^ ctious ,.<;^p^^ envelope 
hepati h s A^tP~d^%^ 



B virus 




Core protein 






P. HBsAq 
J particle 



FIGURE 23-3 

Production of a subunit vaccine in yeast. Hepatitis B virus 
(HBV) is encoded by a small 3 2 -kb genome that has been 
cloned and sequenced. Both the whole vims and a smaller 
HBsAg (HBV surface antigen) particle ate found in the 
blood of infected patients. To prepare a vaccine against 
HBV, which has been difficult to propagate in culture, the 
HBsAg gene was cloned into a vector for expression in the 
yeast Saccbaromyces cerevzsiae. Transcription occurs from the 
strong promoter from the gene encoding alcohol dehydro - 
genase L A transcription terminator -was' placed downstream. 
The vector contains replication origins and markers for both 
bacteria and yeast Yeast transformed with this plasmid can 
be grown to high ceil densities in fermentors. This process 
results in the accumulation of large amounts of HBsAg 

lOtClUj wlllUl <J.pOIl [JUUULilLluii waS i'..'uud « £ ^ j. ^- £ " 

into particles about 20 nanometers in diameter, resembling 
the particles found in HBV-infected patients: 



char forms blood clots. Rapid administration of a plas- 
minogen activator after a heart attack dissolves the 
life-threatening clots that lead to irreversible damage 
of heart muscle. Tissue plasminogen activator is com - 
mercially produced from a mammalian cell line car- 
rying a stably integrated, highly amplified expression 
vector (Figure 23 4). 

Another protein being produced by mammalian cell 
culture is Factor VIII, a protein required for normal 
clotting of the blood. Genetic defects in Factor VIII 
production, are responsible for hemophilia. For many 
years, hemophiliacs have been treated with Factor VIII 
purified from human blood. With the contamination 
of the human blood supply by the AIDS virus, how- 
ever, thousands of hemophiliacs became infected and 
hundreds died from AIDS. The Factor VIII cDNA 
had already been cloned before scientists found that 



HBsAg coding 




Yeast 
promoter 



Yeast 

replication 

origin 





expression 
vector 



tEU2 



Yeast 
^ transcription 
terminator 

Bacterial 
replication 
origin 



Amp f 




Transform yeast cells 



q) 



o 




Select cells that contain 
plasmid by growth on 
medium lacking leucine 




Culture 
cells in a 
fermentor 




Isolate , 
cells by 
centrifugci Hon 




Break open 
yeas!" ceils 





Purity HBsAg 
particles 



60 



CHAPTER 



the blood supply was contaminated with the AIDS 
virus. Recognition of the need for a safer source of 
Factor VRI accelerated efforts already under way to 
produce the protein using recombinant DNA methods. 
Like tPA, Factor VIII. is a large and complex protein 
and can only be efficiently produced m mammalian 
ceil culture. But the availability of recombinant protein 
will spare future generations of hemophiliacs from 
infectious agents that contaminate the blood supply. 



mna\\ 



Monoclonal Antibodies Function 
as "Magic Bullets 77 

We have discussed the use of biotechnology to pro- 
duce novel vaccines that elicit antibody production 
by the body's immune system. As we learned in Chap- 
ter 1 6, antibodies are exquisitely selective proteins that 
can bind to a single target among millions of irrelevant 
sites. Researchers have long dreamed of harnessing the 
specificity of antibodies for a variety of uses that re- 
quire the targeting of drugs and other treatments to 
particular sites in the body. It is this use of antibodies 
as targeting devices that led to the concept of the 
"magic bullet/ 1 a treatment that could effectively seek 
and destroy tumor ceils and infectious agents wherever 
they resided. 

The major limitation in the therapeutic use of an- 
tibodies is producing a useful antibody m large quan- 
tities. .Initially, researchers screened myelomas, which 
are antibody-secreting tumors, for the production of 



FIGURE 23-4 

Production of tissue plasminogen, activator (tPA) fay mam- 
malian cell culture. The cloned cDNA for human tPA was 
ligated mto an expression vector that contained a strong 
promoter and terminator. The vector was stably transferred 
into a mammalian ceil line. The initial transform ants 
secreted tPA into the culture medium, but the level of 
expression was very low.. Cell lines that expressed tPA to 
high levels were obtained using methotrexate treatment, 
which selects for cells that have amplified the dhfr gene ■ 
resident in the vector together with the linked tPA expres- 
sion cassette (Chapter • 12). High-expressing lines are grown 
in large fermentors and recombinant tPA is purified from, 
the culture medium. 



tPA sionai f 
secuence 



oa-i ru:j sequence 
br mature tPA 



cJoned 
! cDNA U- 
human tPA 




Cleave with Hindi fl 
and EcoRj 
igate to expression 
vector 



Mr 




Pron 




X 

M§ ccok\ 



Terminator 



Expression vector 




Introduce into 
mammalian cells 




Select stable 
transform ante by 
growth in HAT medium 



Cells express 
( low levels 
of fPA 



Methotrexate selection 




Cells express 
high levels 
of tPA 



Culture cells in ferrnentor 




«- tPA secreted 
into culture 
medium 



Recombinant DMA m 



.Vied t.C. 



:i fnd 



nousrr 



-?0 



'i ■ 



purihed Ag DO Q 0 



so E ate 



ipleen cells 0 © 0 



m m 



Myel 



oma cells 



Cell fusion 



Select hybridoma cells 
in HAT medium 



Seed individual 
cells ink> wells 




Culture cell; 




Antibodies are secreted 
info culture medium 



doma culture 



N\ /f \\ // Test hybri 
VuV medium for MAb thai 

I reacts with antigen 




Propagate positive clones 



Freeze away 
a cell stock 



Isolate MAb from culture medium 



FIGURE 23-5 

Production of. a monoclonal a nobody (MAbl.. A mouse is 
inoculated with an antigen (Acn of interest. This stimulates 
the proliferation of lymphocytes expressing antibodies 
against the antigen... Lymphocytes are taken from the spleen 
and fused to myeloma cells by treatment with polyethylene' 
glycol. Hybrid cells are selected by growth in HAT medium 
(Chapter 12), The myeloma cells lack, the enzyme HPRT 
and thus die in this medium unless they become fused with 
a lymphocyte, which expresses the missing enzyme. Un fused 
lymphocyte cells soon die off as well, because they do not 
f?row for lonff in culture. Individual hybrid cells are 
transferred to the wells of a microuter dish and cultured 
for several days. Aiiquots of the culture fluids ate removed 
and tested for the presence or antibody (Ah) that binds the 
antigen. Cells that test positive are cultured for monoclonal 
antibody production. Antibody -producing cell lines are 
stored frozen in liquid nitrogen (this process is called cell 
hanking}., Aiiquots can be thawed out and cultured as needed. 



is inoculated with the antigen to which an antibody 
is desired. After the animal mounts an immune re- 
sponse to the antigen, its spleen, which houses anti- 
body-producing cells (lymphocytes), is removed, and 
the spleen cells are fused en masse to a specialized 
myeloma cell line that no longer produces an antibody 
of its own. The resulting fused cells, or hybridomas, 
retain properties of both parents. They grow contin- 
uously and rapidly in culture like the myeloma cell, 
yet they produce antibodies specified by the lympho- 
cyte from the immunized animal Hundreds of hy- 
bridomas can be produced from a single fusion 
experiment, and they are systematically screened to 
identify those producing large amounts of a desired 
antibody. Once identified, this antibody is available in 
limitless quantities. Monoclonal antibodies are already 
widely used for the diagnosis of infections and cancer 
and for the imaging of tumors for radiotherapy. And 
investigations into their use in the direct treatment of 
cancer, inflammation, and immune disorders is on the 
rise. 



useful antibodies. But they lacked a means to program 
a myeloma to produce an antibody to their specifi- 
cations. This situation changed dramatically with the 
development of monoclonal antibody technology. The 
procedure for producing monoclonal antibodies, or 
MAbs, is shown in Figure 23.-5. First a mouse or rat 



Human Antibodies That Recognize 
Specific Antigens Can Be Directly 
Cloned and Selected 

One new application of monoclonal antibody tech- 
nology is the generation of abzymes, antibodies that 
behave like enzymes to catalyze a chemical reaction. 



462 



C H A P T £ R 



FIGURE 2.3-6 

Direct cloning of antibody cDNAs by PGR. To engineer sn 
antibody, the amino acid sequence of the variable domain 
needs, to be determined. This could be done by sequencing 
a purified preparation of che heavy- (H) and light- (L) chain 
proteins, but a simpler method is to deduce the sequence 
from the cloned cDNA. In the past, a cDN'A library was 
prepared from hybridoma rnRNA and screened with probes 
from the constant regions of the H and L chain genes, A 
simpler method has been developed that uses the. PGR. 
From a comparison of a large number of antibody 
sequences, ammo acids frequently found at the amino 
termini of antibodies were identified. From rhis information, 
a set of degenerate PGR primers was designed that cor- 
respond to all the possible sequences in this region. Because 
the amino acids in the constant domains of different anti - 
bodies are aearly identical, only one PGR. primer is needed 
for che 3' end of each H and L chain sequence. To directly 
clone the antibody cDNAs, cDNA is prepared by treating- 
hybridoma rnRNA with reverse transcriptase, mixed with a 
pair of PGR primers (in. this case, for amplifying 1 the 
heavy chain sequences), and subjected to PCR. Without 
knowledge of the amino ter minus of the antibody chain, a 
PCR had to be set up with each of the different 5' primers 
until an amplified DNA fragment was obtained. The pro- 
cess can be simplified if the sequence of the first six or 
seven amino acids of the antibody can be determined; this 
is -sufficient to design a single 5 f PCR primer. 



Rearranged heavy chain aene 



V K D) 



i r 



Q-d 



H C H ? 



i I 



Heavy chain rnRNA 

^^[W^/i l^-'l^j 



Isolate hybndoma rnRNA 



Synthesize cDNA with reverse 
transcriptase 



-j ffSi^AAAA Al 



T T T f 71 



Primer 



Anneal degenerate PCR 
primers to cDNA 



1 TTTT T 



Primer 



PCR 



m 



11 C H 1 




Enzymes catalyze reactions by stabilizing a chemical 
structure intermediate between the substrate and 
product, termed the transition state. Thus, if monoclonal 
antibodies could be made to a transition state ana- 
logue—a molecule resembling the transition state of 
a chemical reaction -then some of these antibodies 
might have catalytic activity. The ability to produce, 
custom-designed catalysts would be very valuable, es- 
pecially to the chemical and pharmaceuticalindustries. 

Initial attempts to produce catalytic antibodies in- 
dicated that they were exceedingly rare and often not 
found among the hybridomas produced by conven- 
tional monoclonal antibody technology. An excellent 
fusion might produce several hundred different an- 
tibodies, but the entire repertoire of antibodies that 
can be produced by the immune system is perhaps 
100 million. How can the entire repertoire be tapped? 

One strategy that shows proxuise is to bypass the 
inefficient fusion step in hybridoma production and 
directly clone antibody cDNAs from die lymphocytes 
of immunized mice (Figures 23-6 and 2 3-7). Inves- 
tigators inoculated a mouse with an antigen. They 




Cloned heavy chai 



recovered spleen cells from the mouse and used PCR 
to amplify millions of cDNAs for antibody light and 
heavy chains. The light- and heavy-chain cDNAs were 
cloned separately into phage vectors and then recom- 
bined in vitro to generate a third, combinatorial library 
of phage carrying random pairs of light and heavy 
chains. The library was plated onto a bacterial lawn, 
and the resulting phage plaques, each containing a 
unique antibody ? were screened with radioactively la- 
beled antigen in a manner similar to that used for 



Recombinant: DNA in Medicine and Industr 



■d.A 



) D 



Lymphocytes from 
immunized mouse 



m 



RNA 



Reverse franscnotase 



DNA 



Amplified 
heavy chain ■< 
cDNAs 



PCR 



PCR. 




Amplifi ed 
light chain 
cDNAs 



Ligate H and L. chain cDNAs 
into A. expression vector 



H cham cDNA ^ L chain cONA 

Package into phage 





CO to 




J Combinatorial 
phage library 



infect E. colt 

H and L chains are 

produced in infected cells 




Prepare replica Filter 



Add labeled antigen 
to Filter 



•~7 



■d in 



1r ' are removed from an 



syritne- 



FIGURE 2 3 

C rearing a combinatorial library of antibodies exoress> 
£ ^/r. Lymphocytes or spleen cell 
immunized animal... mRNA is ob rained and cDNA 
sized with reverse transcriptase. The heavy- (H) and light-- 
(L) chain genes are separately amplified by PCR, as shown, 
in Figure 23-6, and h gated into k cloning vectors. Two dif- 
ferent libraries are produced., one containing the H chain 
genes and one containing rhe L chain genes (this step has 
been omitted from the figure for simplicity). Phage DNA is 
isolated from each library, and the H and L chain sequences 
axe ligaced together and packaged to form a combinatorial 
library. Each phage now contains a random pair of H and 
L chain cDNAs and thus upon infection of E. colt directs 
the expression of the two antibody .chains in infected cells. 
Since the H chain sequence contains only the variable 
region and the first constant domain, the antibody that 
forms is called a Fab, for antigen binding fragment k binds 
the antigen much like an intact antibody but it lacks the 
effector domain. To identify an antibody that recognizes the 
antigen, the phage library is plated, and the antibody (Fab) 
molecules present in the plaques are transferred to filters. 
The filters are incubated with radioacrively labeled antigen 
and then washed to remove excess unbound ligand. A 
radioactive spot on the autoradiogram identifies a plaque 
that contains an antibody that binds the antigen. A recent 
procedure uses the phage display technology, described in 
Figure 23-L0 7 to select antibodies with desired properties. 



H and L chains associate 
to form Fab molecules 






Heavy chain 
Light chain 



Fab binds to Filter 



Fab binds 
antigen 



Isolate phage 
DNA From 
master plate 




H and L chain-encoding cDNAs . 
from reactive antibody are cloned 



464 



C H A P T E R 



i i 



cloning cDixAs from an expression library (Figure 
7-10). (Jut of a million phage plaques screened, 200 
clones were identified that produced an antibody bind- 
ing the antigen. Thus, with this approach, investigators 
were able to sample a million -possible antibodies — 
at least a thousand times more than they could screen 
by ; conventional monoclonal antibody technology. 
Since phages in a particular plaque encode the anti - 
body expressed in the plaque, it is a trivial matter to 
clone the heavy- and light- chain c DMAs from the 
phage DNA. These cDNAs can. be placed into bac- 
terial or mammalian expression vectors for production 
of large quantities of the selected antibody. 

A recent modification of this method uses hi a men - 
tons phages such as Ml 3 instead of X phage and allows 
display of the antibodies on the phage surface. This 
offers the advantage of being able to screen thousands 
more phage (because the screening can be done in 
solution) and to select phage that express tight-binding 
antibodies- We will discuss this method later and in 
Figure 23-10. 



Mo u s e o n i" i b oti v 



Chimeric antibody Humanized annbodv 



Humanized" Monoclonal Antibodies 
Retain Activity But Lose 
Irnmunogenicity 

Although swift progress is being made in the identi- 
fication of monoclonal antibodies with potential ther- 
apeutic value, their use is limited by a problem we 
have already discussed in this chapter. Monoclonal 
antibodies are usually mouse proteins, and they are 
not identical to human antibodies. Thus, antibodies 
injected into a patient will eventually be recognized 
as foreign proteins and will be cleared from the 
circulation. 

As we learned in Chapter- 16, both chains of the 
antibody molecule can be divided into variable and 
constant regions. The variable regions differ in se- 
quence from one antibody to another, and this' is the 
region of the protein that binds the antigen. The con- 
stant region is die same among all antibodies of the 
same type. The first method used to reduce the irn- 
munogenicity of a mouse monoclonal antibody was 
simply to construct chimeric genes that encoded pro- 
teins in which the variable regions from the mouse 




X \\ \ ///"/ 

V' r ] x/ 



\\\\ / 




Mease variable 
and constant 
regions 

Mouse CDRs 



Human constant 
regions 

Mouse variable 
regions, including 
CDRs 



Human variable 
Framework 
and constant 
regions 

Mouse CDRs only 



FIGURE 2 3 8 

Antibody engineering. The bask structure of a mouse mono- 
clonal antibody (MAb) resembles that of a human antibody. 
However, there are numerous deferences between amino 
acid sequences of the antibodies from the two species„ 
These sequence- differences account for the irnmunogenicity 
of mouse MAbs m humans. A chimeric MAb is constructed 
by figatmg the cDNA fragment encoding the mouse V L and 
V H domains to fragments encoding the C domains from a 
human antibody. Because the C domains do not contribute 
to antigen binding, the chimeric antibody will retain the 
same antigen specificity as the original mouse MAb but will 
be closer to human antibodies in sequence. Chimeric MAbs 
still contains some mouse sequences, however, and may still 
he immunogenic. A humanized MAb contains only those 
mouse amino acids necessary to recognize the antigen. 
This product is constructed by building into a human 
antibody the amino acids from the mouse complementarity 
determining regions or CDRs. 



antibody were fused to the constant regions from a 
human antibody. The chimeric antibody (Figure 
23-8) retained its binding specificity but more closely 
resembled a natural human antibody. 

This antibody, however, was not fully humanized, 
because it retained amino acid sequences from the 
mouse protein. Thus, scientists have set out to engi- 
neer fully humanized monoclonal antibodies that will 
be indistinguishable from natural molecules. Extensive 
studies of the three-dimensional structures of antibody 
molecules tell us that only a few of the one hundred 
amino acids in the variable region of an antibody 
actually contact the antigen; these regions of contact 
are referred to as complementarity determining regions 
(CDRs). Three CDRs each comprise the antigen- 
binding sites on the light and heavy chains- The rest 



Recombinant UNA 



i 11 



Me; 



am.. 



no 



465 



of the variable region serves as a scaffold to anchor 
i he CDRs in the correct positions. This breakdown, 
of amino acids in the variable region into those serving 
recognition and those serving structural roles is also 
evident from simply comparing thesequenc.es of many- 
antibody molecules- Amino acid sequences in the 
CDRs are hypervariable, whereas the structural, or 
framework, amino acids differ little. 

Thus, to make a fully humanized antibody, ail that 
would be required in principle would be to use in 
vitro mutagenesis to transfer the CDR amino acid, 
sequences from a mouse MAb to a natural human 
antibody (Figure 2 3-8). Tins method was used to hu- 
manize an antibody that recognizes an antigen on the 
surface of human lymphocytes. This humanized MAb 
is now m clinical trials as an immunosuppressant and 
for treatment of lymphoid tumors. Another potentially 
valuable MAb binds a growth factor receptor found 
in large numbers on the surface of many breast tumor 
cells. Laboratory experiments showed that this anti- 
body could block the growth of these ceils in culture 
and caused tumors seeded in mice to regress. Unfor- 
tunately, the first hum anized versions of this antibody 
bound the receptor protein but failed to block the 
growth of breast carcinoma cells. Investigators sus- 
pected that the problem was with the framework amino 
acids, and they used computer modeling to design 
amino acid substitutions that would strengthen the 
antibody- antigen interaction. Several such variant an- 
tibodies were produced and tested; one bound the 
receptor 250 times more tightly than did the original 
antibody and successfully blocked tumor cell growth 
in culture. This antibody is now being produced in 
large quantities for clinical trials. 



Protein Engineering Can Tailor 
Antibodies for Specific Applications 

Humanizing monoclonal antibodies is an example of 
the emerging technology of protein engineering, that is, 
a process using recombinant DNA to modify the struc- 
ture of natural proteins to improve or change their 
function. Antibodies are particularly attractive can- 
didates for protein engineering, because their structure 



is understood in ore at detail and because their poten- 

O J. 

rial for use in medicine is enormous. Another way in 
which antibodies are being engineered is by changing 
their effector domains, the regions of the. heavy chain 
that specify antibody function — for example, killing 
of cells marked by the antibody; In this way, the mode 
of action of a monoclonal antibody can be repro- 
g-rammed. One promising strategy is to replace the 
effector domain entirely with a sequence encodings 
toxin. An antibody-toxin fusion protein would deliver 
the toxin specifically to cells bearing the target antigen. 
This product could be an exceptionally potent treat- 
ment for cancer and for viral diseases such as AIDS. 
Antibody engineering is also being used to construct 
bi specific antibodies.. In these antibodies, each of the two 
arms recognizes a different antigen, thus allowing an 
antibody to bridge the two antigens. For example, a 
bispecifte antibody could recognize a tumor cell pro- 
tein with one arm and a protein on the surface of a 
killer T ceil with the other, thereby bringing the killer 
cells directly to the tumor (Figure 23-9). 



Protein Engineering Is Used to 
Improve a Detergent Enzyme 

Subtilisin is a serine protease produced by bacteria. 
Due to its broad specificity for proteins that commonly 
soil clothing, this enzyme was developed for com- 
mercial use in laundry detergents. (It is subtilisin that 
is prominently advertised as the enzyme additive in 
modern detergents.) But the first detergents containing 
subtilisin suffered from a serious drawback: they could 
not be used with bleach, because bleach inactivates 
the enzyme. Biochemical analysis determined, drat loss 
of activity was due to the oxidation of a methionine 
at position 222. Once this happened, the modified 
enzyme lost 90 percent of its activity. Because they 
knew which amino acid was bleach sensitive, however, 
scientists decided to see whether a variant of subtilisin 
could be produced that was no longer sensitive to 
bleach. 

To do this, site directed mutants were constructed 
in the gene encoding subtilisin. The strategy was sim- 
ply to substitute, one at a time, each of the non— wild- 
type amino acids at residue 222. The mutant genes 




FIGURE 2 3-9 

A bispecifk antibody. By using recombinant DNA, the 
cDNAs for antibodies to two different antigens can be engi- 
neered to make an antibody in" which each arm recognises a 
t different antigen. Thus it is possible to re combine antibodies 
to 'surface antigen on tumor cells and to a protein on cyto- 
toxic T ceils to make a bispecinc antibody that brings the 
two cells together to facilitate killing of the tumor cells. 



were cloned into expression vectors arid the 19 dif- 
ferent subtilisin derivatives were expressed. Biochem- 
ical analysis showed that the cysteine- 2.2 2 enzyme was 



even more active, than the wild-type protein, but a 
was also inactivated by bleach. The next most active 
variant was the alanme-substiruted enzyme, which was 
53 percent as active as wild-type subrijisin. This vari- 
ant exhibited no detectable bleach sensitivity, so de- 
tergents containing this engineered subtilisin. can now 
be used with bleach. This new variant of subtilisin is 
an example of a second-generation molecule, a molecule 
specifically engineered for a new desirable trait. Pro- 
tein engineers are currently at work on a third-gen- 
eration molecule that exhibits decreased temperature 
sensitivity so that it can be used in hot water. 

This e xperiment points out the power of recom- 
binant DNA as a tool for the engineering of natural 
products. Changing the properties of a proteintvasTll 
but impossible -prior to the deveTojjim^ 
binant DNA techniques. Now it is not only possible, 
but easyTTFTs a routine exercise for pr oTeiri engineers 
to generate hundreds of variants of a natural " pnJonn 
foTTestingV "THese* crnlngeTTm 

Eased oiTaetailed knowledge of the structure of "a 

protein; afternauvH}^ 

iTpnirel^ 

section, a combination of structural iiaForiaa^ 
rancTom mutagenesis' arid' a po wernalleleH 
proverl protein function can ha ve 'oh^nar icTe^^ 



Growth Hormone Variants 
with Improved Binding Are 
Selected by Phage Display 

To engineer an improved subtilisin enzyme, research- 
ers were aided by the knowledge that only one specific 
ammo acid had to be changed Thus, they could sys- 
tematically vary that amino acid to find the one that 
worked the best. But more complex challenges face 
protein engineers. Is it possible, for/ example, to en 
gineer" antibodies with higher affinity for antigen- to 
design an inhibitor that tightly binds to and blocks a 
cell-surface protein or an enzyme insrde a cell; to 
.generate a growth factor or hormone with increased 
affinity for its receptor? Alterations of this sorr require 
several amino acid changes, and with 20 possible amino 
acids at each position, the number of variants that 



xecu; 



■ibi 



i..r 



rv 



ill 



.CQiCiru 



ncuscr 



r~KZT — 



3 I IL1 

hGH vaciunh 



•vViU rype 



Pr 



o morer 



Mutated coo on 
of receptor - 
binding helix 



V \ 



b gate 



' "n\ /V" "\\ 




""'^■Gene l!i 
Phage display vector 



fO 



Tr a * 



nsrorm 



Geo// 



T rr.!/ 



G 




10 phage J ^3 ^s^e^: 

mutants 



Bind phage to 

Immobilized hGH receptor j 



Rescreen to 
enncn for 
fight binding 
phage 




Fusion protein 
on phage surface 

5i3S3r^ mJ ^-~''' hGH 

"Gene 111 
protein 



I hGH receptor 



Elute bound phage 



Weakly binding 
<sm&sx> ph a g e pass 

: |h rough column 



Binds more 
tightly than wiid type 




Wild type 



Clone individual phage 




isolate phage DNA * 



Determine sequence of 
tight-binding mutant 



need to be screened is enormous (for changes at just 
3 amino acids, there axe 8000 different combinations; 
for 10 amino acids, 10 n different proteins are possible). 
Clearly, these variants cannot be made and tested one 
at a time, and a method for direct selection of improved 
proteins is needed. 

Researchers have used a new approach to select 
variants of human growth hormone with increased 
affinity for growth hormone receptor (Figure 23-10). 



FIGURE 2 3-10 

Expression of proteins and pepaces on che su rf ace or 
filamentous phage, A library of randomly mu tared hGH 

cDNAs was U gated into an M 1 3 based phage in. id vector so 

that hGH was fused to die cat'boxy-terminal domain of die 
Ml 3 gene III protein. The ca.rboxyterau.nus of che gene III 
protein associates with the phage particle, and the ammo 
terrain us s con taming the hGH variants, is displayed on the 
outer surface of the phage. The. library of ph.agem.ids is 
Introduced into E. coli t and ampiciUin.~~resi.sunx colonies are 
obtained.. These E. cok are then infected with a helper phage 

that induces the production of phagemid particles. Only 1- 

• 10 percent of the phage particles contain an hGH -gene III 
fusion protein, and these contain only one hGH fusion 
molecule per phage. This ensures that the phage retain 
sufficient wild-type gene III protein in their coats to 
remain infectious, hGH-phage were passed through a 
column containing the hGH receptor covalendy Linked to 
plastic beads. Only the phage expressing hGH were re- 
tained. The nonbinding phage lacking hGH passed through 
the column. The bound phage were isolated, culm red in. 
E, coli, and ^passed, again over the column. Repeated rounds 
of selection resulted in the identification of hG H variants 
that bound the receptor with exceptionally high affinity. 



From structural studies and extensive mutagenesis of 
hGH, they knew what portions of the amino acid 
sequence were important for receptor binding- They 
synthesized degenerate oligonucleotides that encoded 
all possible amino acids at these positions and hgated 
the pool of oligonucleotides in place of the natural 
■hGH sequence. The resulting pool of variant hGH 
cDNAs was fused to the reading frame of gene 10 in 
the filamentous phage Ml 3. Gene III encodes a minor 
phage coat protein expressed on the surface of che 
phage, and incorporation of the hGH cDNA into this 
gene results in the display of the hGH variants on the 
surface of the phage, one variant per phage. This tech- 
nique is known as phage display. 

Now it was a simple matter to pass this library of 
more than 10 n different phage over a column con- 
taining the hGH receptor. Phage displaying weakly 
binding hGH variants were washed off the column, 
and phage displaying rightly binding variants were 
recovered with a more stringent wash. This population 
of tight-binding phage was amplified by infection of 
E. cbli and passed over the column a second rime. The 
selection was repeated, for a. total of six rounds, each 
round enriching for the phage displaying hGH variants 



■■()■ 



C H 



P 



i BR 2 3 



with highest affinity for the receptor bound to rhe 
column. At this point, individual phage were cloned, 
the affinities of their hGH variants were measured 
directly, and the sequences of the hGH cDNAs were 
examined. Among these variants was one that hound 
its receptor about 10 times more tightly than natural 
hGH did. When selected amino acids from another 
region of hGH that had been randomized were in - 
troduced into this variant, the resulting hGH molecule 
bound to the hGH receptor over 50 tunes more tightly 
than the wild-type hGH did. This process is being 
repeated m the hope to obtain even more tightly bind- 
ing variants. 

The ability afforded by techniques such as phage 
display to correlate protein structure and function in 
a systematic way makes possible new methods of find- 
ing novel drugs. If researchers have a good idea what 
combination of amino acids gives the best fit to the 
binding site on a receptor, the next step in rational 
drug design would be to design, or even select, a small 
peptide that binds as well as the larger protein. And 
then, using computer modeling to display the molec- . 
ular contacts between hgand and receptor, researchers 
can attempt to design and synthesize small nonprotein 
molecules that make the same contacts. The end- 
product would be a small organic molecule that could 
be produced more cheaply than a recombinant protein, 
yet would retain the full biological activity of the 
protein hormone. And, more important, such mole- 
cules could be administered orally, thus eliminating 
the major disadvantage of most recombinant protein 
therapeutics— that they must be delivered directly 
into the bloodstream by injection. This type of rational 
drug design contrasts sharply with the conventional 
approach to drug discovery now m use in the phar- 
maceutical industry, in which an inventory of com- 
pletely unrelated compounds is tested at random until 
an active compound is found. 



Reading List 



.New Techno looies Promise New 
Approaches to Drug Design 

The biotechnology industry is in its infancy, and us 
successes to date folio w r directly from developments 
in molecular biology that are already nearly two de- 
cades old. The recombinant drugs currently in clinical 
use arise from what is by now conventional technol- 
ogy—gene cloning, expression, and mutagenesis to 
improve protein function. These methods will con- 
tinue to turn out new drugs such as erythropoietins 
to treat anemia caused by kidney disease, DNase to 
treat cystic fibrosis, or colony-stimulating factors 
(CSFs) to increase white blood cell production, durmg 
chemotherapy. 

But the true promise of biotechnology is in novel 
technologies that are only now being developed. We 
have mentioned efforts to design catalytic antibodies 
that can accelerate chemical reactions in both medical 
and industrial applications. This is but one example 
of a whole new approach to protein engineering in 
which novel activities can be placed on unrelated pro- 
tein scaffolds, using random mutagenesis coupled with 
selection methods like phage display. Similar goals 
may be achieved by the engineering of ribvzymes, RNA 
molecules with catalytic activity, and the use of the 
polymerase chain reaction to select nucleic acid mol- 
ecules that bind tightly to targets of medical impor- 
tance. Another strategy that may see widespread 
application is treatment with antisense DNA and RNA 
to inhibit the expression of oncogenes in tumors or of 
viral genes in infected patients. And a variety of new 
technologies based on viral vectors promise new ap- 
proaches for vaccines and gene therapy. 

Many of these techniques now work in the test 
tube, and the principal challenge facing biotechnology 
companies is to turn these laboratory techniques into 
commercially viable processes. 



General 



Half S. S. Invisible Frontiers; The Race to Synthesize a Human 
Gene. Atlantic Monthly Press, New York, 1987. 



Hood, L. "Biotechnology and medicine of the future. 75 J. 
Am. Med. Assoc., 259: 1837-1844 (1938). 



Keco rnoij 



lanr .. ) i . \ in 



Med 



iO. : ) •.- 



.nd 



Ind 



ustr 



169 



Goeddel, LX V. (ed.) Systems for Heterologous Gene 
Expression. Aletb. Enzymol, Vol. 185, Academic Press, 
New York, L990. 

Original Research Papers 

EXPRESSION OF HUMAN PROTEINS IN E. CO LI 

Itakura, KG T. Hi rose, R. Crea, A. Riggs, H. L.. Heyneker, 
F. Bolivar, and H. Boyer. "Expression in E. colt of 
a chemically synthesized gene for the hormone so- 
matostatin." Science, 198: 1056-1063 (1977). 

Goeddel D. V., H. L. Heyneker, T. Hozumi, R. Arentzen, 
K. Itakura, D. G. Yansura, M.J Ross, G. Miozzan, R. 
Crea, and P. Seeburg. "Direct expression in Escherichia 
colt of a DNA sequence coding for human growth hor- 
mone." Nature, 281: 544-548 (1979). 

Goeddel/ D. V., D. G. Kleid, F. Bolivar, H. L. Heyneker, 
D. G. Yansura, R. Crea, T. Hirose, A. Kraszewski, K. 
Itakura, and A. D. Riggs. "Expression of chemically 
synthesized genes for human insulin." Proc. Natl. Acad. 
Sci USA, 76: 106-110 (1979), 

EXPRESSION IN YEAST 

Valenzuela, P., A. Medina, W\ J Rutter, G. Ammeter, and 
B. D. HalL "Synthesis and assembly of hepatitis 8 virus 
surface antigen particles in yeast/ 1 Nature, 298: 347— 
350 (1982), 

Hirzeman, R. A., D. W. Leung, L.J Perry. W.J. Kohr, HL 
L. Levine, and D. V. Goeddel. "Secretion of human 
interferons by yeast" Science, 219: 620-625 (1983). 

Sabin t E. A., C. T. Lee-Ng, J. R. S fluster, and P. J. Barr. 
"High-level expression and in vivo processing of chi- 
meric ubiqukm fusion proteins in Saccharomyces cere- 
visiaer Bio/ Tech., 7: 705-709 (1989). 

EXPRESSION IN INSECT CELLS 

Luckow, V. A., and M- D. Summers. "Trends in the de- 
velopment of baculovirus expression vectors." Bio/ 
Tech., 6: 47-55 (1988). [Review] 

Medio, J, A., L. Flunt, K. Gathy, R. K. Evans, and M. S- 
Coleman. "Efficient, low-cost protein factories: expres- 
sion of human adenosine deaminase in baculovirus- 
infected insect larvae.' 1 Proc. Natl Acad. Sci USA, 87: 
2760-2764 (1990). 

EXPRESSION OF PROTEINS IN MAMMALIAN CELLS 

Gorman, C. M. "Mammalian cell expression," Carr. Optn^ 

Biotech., 1: 36-43 (1990). [Review] 
Pennica, D., W, E. Holmes, W.J. Kohr, R. N. Harkins, G. 

A. Vehar, C. A. Ward, W. F. Bennett, E. Yelverton, P. 

PL Seeburg, H L. Heyneker, D. V. Goeddel, and D. 

Collen. "Cloning and expression of human tissue-type 



plasminogen activator cDNA in £ colt'' Nature, 30]: 
214-221 (1985). 
Paborsky, L. R., B. M.. Fendly, K. L. Fisher, R. M. Lawn, 

B. J. Marks, G. McCray, EC M. Tate, G. A. Vehar, and 
C M. Gorman. "Mammalian cell transient expression 
of tissue factor for the production of antigen." Prot En?., 
3: 547-553 (1990). 

VACCINES 

Brown, F. "From Jennet to genes-— the new vaccines." Lan- 
cet, 335: 587-590 (1990). [Review] 

Bolognesi, D. P. "Approaches to HIV vaccine design." 
Trends Biotech 8: 40-45 (1990). [Review] 

Berman, P. W., T.j. Gregory, L. Riddle, G. R. Nakamura, 
M A. Champe, J P. Porter, F. M. Wurm, R. D. Hersh- 
berg, E. K. Cobb, and J W. Eichberg. "Protection of 
chimpanzees from infection by HIV-1 after vaccination 
with recombinant glycoprotein gpl20 but not gp!60. J> 
Nature, 345; 622-625 (1990V 

PROTEIN ENGINEERING RECOMBINANT PRODUCTS 

EstelJ D. A., T. P. Graycar, and J A. Wells, "Engineering 
an enzyme by site-directed mutagenesis to be resistant 
to chemical oxidation/ 1 J. BwL Uhcm. 260: 6518—6521 
(1.985). 

Desjarlais, R. L., G. L. Seibel, L D. Kuntz, P. S. Furth, J. 

C. Alvarez, P. R Ortiz de Montellano, D. L. DeCarnp, 
L. M. Babe, and C. S. Crailc "Structure-based desien 
of nonpeptide inhibitors specific for the human im- 
munodeficiency virus 1 protease/ 1 Proc Natl. Acad. ScL 
USA t 87: 6644-6648 (1990). 

Abrahmsen, L., J. Tom, J.. Burnier, K. A. Butcher, A. Kos- 
siakofF, and J A, Wells. "Engineering subtilisin and its 
substrates for efficient ligation of peptide bonds in 
aqueous solution." Biochemistry, 30: 41 5 L 4159 (1991). 

Bennett, W. F., NL F. Paoiri, B. A Keyt, D. Botstein, A. J^ 
S.Jones, L. P rest a, b\ M. Wurm, and M.J. Zoller. "High 
resolution analysis of functional determinants on hu 
man tissue-type plasminogen activator."" J. Biol. Chem., 
266: 5191-5201 (1991). 

Cunningham, B. C. and J. A. Wells, "Rational design of 
receptor-specific variants: of human growth hormone. 77 
Proc. NatL Acad. Sci. USA, 88: 3407-341 1 (1991). 

MONOCLONAL ANTIBODIES 

Milstein, C. "Monoclonal antibodies/ 1 ScL Am., 243: 66—74 
(1980)., [Review] 

CLONING AND EXPRESSION OF ANTIBODIES 

Pluckthun, A. "Antibodies from. Escherichia colL" Nature, 347: 
497-498 (1990). [Review] 



Huse, YV D. s L Sascry, S. A. Iverson. A. S Kanv M ARino- 
Mies, D. R. Burton, S. J. Benkovic and R. A. Lerner.. 
"Generanon of a large comhmaton a J library of the 
- immunoglobulin repertoire m phage lambda" Science, 
2 RP 1275-1289 (1989), 
Chaudhary, V. K., R K. Batra, M. G. Galio, M. C. WIR 
Hnghaim '). J. FitzGeraRL and I. Pastan. "A rapid 
method of cloning functional variable-region, antibody 
genes in Escherichia colt as single-chain immimotoxins." 

Proc Nad. Acad Set, USA, 87: LOon 1070 (1990). - 

Mullmax, R. L. et al. "Identification of human antibody 
fragment clones specific for tetanus toxoid in a bac- 
teriophage A imrnunoexpressipn library/ 1 Proc. Nad. 
Acad.. Set. USA, 87: 8095-8099 (1990).. 
Berg, J, E. Lotscher, K. S. Steimer. D. ). Cauon, J Baenzicrer, 
RL M.Jack, and M. Wabl. "Bispecific antibodies that 
mediate killing of cells infected with human immu- 
nodeficiency vims of any strain." Proc. Nad. Acad. Sci 
USA,. 88: 4723 -4727 (1991). 
Wood, C.R.; A. J. Corner, G. E. Morris, E. M. Alderman, 
D. Wilson, R. M.J. OTIara, and R.J Kaufman. "High 
level synthesis of immunoglobulins in Chinese hamster 
ovary cells." J. Immunol: 145: 3011-3016 (1990). 

CATALYTIC ANTIBODIES 

Jacobs J. W. "New perspective on catalytic antibodies." Bio/ 
Teck, 9: 258-262 (1991). [Review] 

Benkovic, S.J, J A. Adams, C R. Borders, K. D.Janda, and 
R. A. Lerner. "The enzymic nature of antibody catal- 
ysis: development of multistep kinetic processing. 1 ' Sci- 
ence, 250: 1135-1139 (1990). 

Bowdish, K., Y. Tang, J B. Hicks, and D. Hilvext. "Yeast 
expression of a catalytic antibody with chorisrnate mu- 
tase activity." ?. Biol Chem., 266: 11901-11908 (1991). 

HUMANIZING ANTIBODIES 

Winter, G., and C. Milstein. "Man-made antibodies." Nature, 

349: 293-299 (1991). [Review] 
Jones, R T., R H. Dear, J Foote, M. S. Neuberger, and G. 

Winter. "Replacing the complementarity-determining 

regions in a human antibody with those from a moused' 

Nature, 321: 522-525 (1986). 
Riechmann, L., M. Clark, H. Waldmann, and G. Winter. 

'Reshaping human antibodies for therapy." Nature, 3 32: 

323-327 (1988). 
Carter, P, L, Presta, C. M. Gorman, J B. Ridgway, D. 

Henner, W. L. T. Wong, A. M. Rowland, C. Kotts, M. 

E. Carver, and M. PL Sheppard. "Humanization of an 

anti-pI35 HEiU antibody for human cancer therapy." 

Proc. Natl Acad. Set. USA, 89: In press (1992). 



PHAGE DISPLAY 

Smith, G. P. "Filamentous fusion phage: novel expression 
vectors that display cloned antigens on the virion sur- 
face." Science, 228: 131 5 — 1 3 1 7 (198 5). 

Cwiria, S. E., E. A. Perers, R. W. Barrett, and W.J Dower. 
"Peptides on phage: a vast library of peptides for iden- 
tifying ligands." Proc. Nad. Acad. Set. USA, 87: 6378-6^3? 
(1990). 

Clackson, T. f H. R. Hoogenboom, A. D. Griffiths, and G. 
Winter. "Making antibody fragments using phage dis- 
. play libraries." Nature, 352: 624 628 (1991). 

Lowcnan, H. B., S. Bass, N. Simpson, and J A. Weils. "Se- 
lecting high affinity binding proteins by monovalent 
phage display." Biochemistry, 30, 10852- 10838 (1991). 

NEW RECOMBINANT DRUGS 

Shak, SR D.J Capon, R. Hellmiss, S. A. Marsters, and C. 
L. Baker: "Recombinant human DNase I reduces the 
viscosity of cystic fibrosis sputum." Proc Nad. Acad. Set. 
USA, 87: 9188-9192 (1990). 
Takaue, Y., et al. "Effects of recombinant human G-CSF, 
GM-CSF, IL-3, and IL-l alpha on the growth of pu- 
rified human peripheral blood progenitors." Blood, 76: 
330-335 (1990). 
Shepard, KM, G. D. Lewis, J. Q Samp, B. M. Fendly, D. 
Maneval, J. Mordenri, L Figari, C. E. Kotts, M. A. 
Palladino, A. Ullrich, and D. slamon. "Monoclonal an- 
tibody therapy of human cancer: cak 
tooncogene to the clinic." J. Clin. Immunol, II: 117-127 
(1991). 

Watson, S. R., C. Fennie, and L. A. Lasky. "Neutrophil 
influx into an inflammatory site inhibited by a soluble 
homing receptor-IgG chimaera." Nature, 349: 164-167 
(1991). 

FUTURE TRENDS 

Sarver, N, E. M. Cantin.P.-S. Chang J. A. Zala, P. A. Ladne, 
- D. A. Stephens, and J J. Rossi. "Ribozymes as potential 
anti-HIV-1 therapeutic agents." Science, 247: 1222-1 225 
(1990 J 

Tuerk, Q, and L. Gold. "Systematic evolution of ligands 
by exponential enrichment: RNA ligands to bacterio- 
phage T4 DNA polymerase" Science, 249: 505-510 
(1990).- 

Han, L.J. S. Yun, and T. E. Wagner. "Inhibition of Moloney 
murine leukemia virus-induced leukemia in transgenic 
mice expressing antisense RNA complementary to the 
retroviral packaging sequences." Proc Nad. Acad. Set. 
USA, 88: 4313-4317 (1991). 

Miller, P. S. "Oligo nucleoside methylphosphonates as an- 
tisense reagents." Bio/Tech., 9: 358-362 (1991). 

SchultZjJ. S. "Biosensors." ScL Am,, 265: 64-69 (1991). 



